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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: 2ENECA LIMITED 

(B) STREET: 15 STANHOPE GATE 

(C) CITY: LONDON 

10 (E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP): W1Y 6LN 
<G) TELEPHONE: 0171 304 5000 
O < H > TELEFAX: 0171 304 5151 



if? 



(I) TELEX: 0171 834 2042 
(ii) TITLE OF INVENTION: PROTEINS 



(iii) NUMBER OF SEQUENCES: 131 

P 

%20 (iv) COMPUTER READABLE FORM: 



25 



(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS-DOS 

■(D) SOFTWARE: Patentln Release U.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GGAAGCTTGA AGATGGATAC AGTTGGTGCA GC 



32 



5 (2) INFORMATION FOR SEQ ID NO: 2: 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
M (C) STRANDEDNESS: single 
tig (D) TOPOLOGY: linear * 

(1 15 



in 



(ii) MOLECULE TYPE: other nucleic acid 



Si 20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGAAGCTTAG ACAGATGGGG GTGTCGTTTT G 
25 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



31 



35 




5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Glu Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Ser Gly Ala 
1 5 10 15 

Ser Val Lys Leu Ser Cys Thr Ala Ser Gly Phe Asn He Lys Asp Asn 
20 25 30 

Tyr Met 



10 



(2) INFORMATION FOR SEQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: other nucleic acid 



ZD (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GACATTCAGC TGACCCAGTC TCCA 
<2) INFORMATION FOR SEQ ID NO: 5: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



Cii) MOLECULE TYPE: other nucleic acid 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 5 

GACATTGAGC TCACCCAGTC TCCA 

5 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
<B> TYPE: nucleic acid 
10 <C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

15 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 6 
20 AGGTSMARCT GCAGSAGTCW GG 

(2) INFORMATION FOR SEQ ID NO: 7: 

25 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ACTAGTGGAA TTCAGTGTGA GGTSCARCTG CAGCARTCWG G 



41 



5 (2) INFORMATION FOR SEQ ID NO: 8: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: other nucleic acid 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



20 GACATTGAGC TCACCCAGTC TCCAGCAATC ATGTCTGCAT CTCCAGGGGA GAAGGTCACC 



60 



ATAACCTGCA GTGCCAGCTC AAGTGTAACT TACATGCACT GGTTCCAGCA GAAGCCAGGC 120 



ACTTCTCCCA AACTCTGGAT TTATAGCACA TCCAACCTGG CTTCTGGAGT CCCTGCTCGC 180 



25 



TTCAGTGGCA GTGGATCTGG GACCTCTTAC TCTCTCACAA TCAGCCGAAT GGAGGCTGAA 24 0 



GATGCTGCCA CTTATTACTG CCAGCAAAGG AGTACTTACC CGCTCACGTT CGGTGCTGGG 300 



30 ACCAAGCTGG AGCTGAAACG GGCTGATGCT GCACCAACTG TATCCATCTT CAAGCTT 



357 



(2) INFORMATION FOR SEQ ID NO: 9: 



35 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asp lie Glu Leu Thr Gin Ser Pro Ala U e Met Ser Ala Ser pro Gly 

Glu Lys val Thr He Thr Cys Ser. Ala Ser Ser Ser Val Thr Tyr Met 
20 25 30 

His Trp Phe Gin Gin Lys Pro Gly Thr Ser Pro Lys Leu Trp He Tyr 
35 40 45 



Ser Thr Ser Asn Leu Ala Ser Gly Val Pro Ala Arg Phe Ser Gly Ser 

60 



50 55 



Gly Ser Gly Thr Ser Tyr Ser Leu Thr lie Ser Arg Met Glu Ala Glu 
" 70 ^ 80 

Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Arg Ser Thr Tyr Pro Leu Thr 
85 90 95 

Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys Arg Ala 
100 105 

(2) INFORMATION FOR SEQ ID NO: 10: 

!i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 
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<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10; 

GAGGTGCAGC TGCAGCARTC WGGGGCAGAG CTTGTGAGGT CAGGGGCCTC AGTCAAGTTG 60 
Jp TCCTGCACAG CTTCTGGCTT CAACATTAAA GACAACTATA TGCACTGGGT GAAGCAGAGG 120 

& 15 

p . CCTGAACAGG GCCTGGAGTG GATTGCATGG ATTGATCCTG AGAATGGTGA TACTGAATAT 180 

in 

GCCCCGAAGT TCCGGGGCAA GGCCACTTTG ACTGCAGACT CATCCTCCAA CACAGCCTAC 24 0 

3 

M 20 CTGCACCTCA GCAGCCTGAC ATCTGAGGAC ACTGCCGTCT ATTACTGTCA TGTCCTGATC 300 
W TATGCTGGTT ATfTGGCTAT GGACTACTGG GGTCAAGGAA CCTCAGTCGC CGTCTCCTCA 360 

s=£s 

25 (2) INFORMATION FOR SEQ ID NO: 11: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



35 



f4i 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Ser Gly Ala 
15 10 15 



Ser Val Lys Leu Ser Cys Thr Ala Ser Gly Phe Asn 



20 



25 



lie Lys Asp Asn 
30 



Tyr Met His Trp Val Lys Gin Arg Pro Glu Gin Gly Leu Glu Trp lie 
35 40 



45 



Ala Trp lie Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe 
50 55 



60 



Arg Gly Lys Ala Thr Leu Thr Ala Asp Ser Ser Ser Asn Thr Ala Tyr 
65 70 



75 



80 



Leu His Leu Ser Ser Leu Thr Ser Glu 



Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 



His Val Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp Gly Gin 
100 105 



110 



Gly Thr Ser Val Ala Val Ser Ser 
"5 ' 120 



INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 39 base pair 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAGCTTTCCC GCGGGGACAT TGAGCTCACC CAGTCTCCA 
5 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

q (ii) MOLECULE TYPE: other nucleic acid 

IK. 

fi 15 

f=| 

in 

%Q ' < xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

^ 20 AAGCTTCTCG AGCTTGGTCC CAGCACCGAA 

ry 

S s I 

< 2 > INFORMATION FOR SEQ ID NO: 14: 

25 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

CD) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 
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39 



30 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
AAGCTTGGAA TTCAGTGTGA GGTGCAGCTG CAGCAG 
5 (2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: other nucleic acid 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

20 AAGCTTCGAG CTCACGGCGA CTGAGGTTCC TTG 

(2) INFORMATION FOR SEQ ID NO: 16: 

'. i; SEQUENCE CHARACTERISTICS: 
25 iA) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 (ii) MOLECULE TYPE: other nuclei 



c acid 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGGATTTTC AAGTGCAGAT TTTCAGCTTC CTGCTAATCA GTGCTTCAGT CATAATGTCC 60 
5 CGCGGGGACA TTGAGCTCAC CCAGTCTCCA GCAATCATGT CTGCATCTCC AGGGGAGAAG 120 
GTCACCATAA CCTGCAGTGC CAGCTCAAGT GTAACTTACA TGCACTGGTT CCAGCAGAAG 160 
CCAGGCACTT CTCCCAAACT CTGGATTTAT AGCACATCCA ACCTGGCTTC TGGAGTCCCT 
GCTCGCTTCA GfGGCAGTGG ATCTGGGACC TCTTACTCTC TCACAATCAG CCGAATGGAG 



ACGCTGAGCA AAGCAGACTA CGAGAAACAC AAAGTCTACG CCTGCGAAGT CACCCATCAG 
25 GGCCTGAGTT CGCCCGTCAC AAAGAGCTTC AACAGGGGAG AGTGT 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
30 <A> LENGTH: 235 amino acids 

(B) 'TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 (ii) MOLECULE TYPE: protein 



240 



300 



360 



GCTGAAGATG CTGCCACTTA TTACTGCCAG CAAAGGAGTA CTTACCCGCT CACGTTCGGT 
15 GCTGGGACCA AGCTCGAGAT CAAACGGACT GTGGCTGCAC CATCTGTCTT CATCTTCCCG 420 
CCAT CTGATG AGCAGTTGAA ATCTGGAACT GCCTCTGTTG TGTGCCTGCT GAATAACTTC 
TATCCCAGAG AGGCCAAAGT ACAGTGGAAG GTGGATAACG CCCTCCAATC GGGTAACTCC 

20 

CAGGAGAGTG TCACAGAGCA GGACAG CAAG GACAGCACCT ACAGCCTCAG CAGCACCCTG 



480 



540 



600 



660 



■705 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Asp Phe Gin Val Gin lie Phe Ser Phe Leu Leu lie Ser Ala Ser 
1 5 10 is 

Val He Met Ser Arg Gly Asp He Glu Leu Thr Gin Ser Pro Ala lie 
20 25 30 

Met Ser Ala Ser Pro Gly Glu Lys Val Thr He Thr Cys Ser Ala Ser 
35 40 45 

Ser Ser Val Thr Tyr Met His Trp Phe Gin Gin Lys Pro Gly Thr Ser 
50 55 60 

Pro Lys Leu Trp He Tyr Ser Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 • ™ 75 eo 

Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He 
85 go 95 

Ser Arg Met Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Arg 
100 105 no 

Ser Thr Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu lie Lys 
115 120 125 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu 
130 135 i 4 o 



Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe 
145 150 155 160 



Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin 
165 170 175 

Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser 
!80 185 190 



WO 97/42329 ^ FCTGB97/MMS 

-101- 

Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
195 200 205 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser 
5 210 215 220 

Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 235 

10 (2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 65 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Ui> MOLECULE TYPE: other nucleic acid 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

25 ATGAAGTTGT GGCTGAACTG GATTTTCCTT GTAACACTTT TAAATGGAAT TCAGTGTGAG - 60 

GTGCAGCTGC AGCARTCAGG GGCAGAGCTT GTGAGGTCAG GGGCCTCAGT CAAGTTGTCC 120 

TGCACAGCTT CTGGCTTCAA CATTAAAGAC AACTATATGC ACTGGGTGAA GCAGAGGCCT 180 

30 

GAACAGGGCC TGGAGTGGAT TGCATGGATT GATCCTGAGA ATGGTGATAC TGAATATGCC 240 

CCGAAGTTCC GGGGCAAGGC CACTTTGACT GCAGACTCAT CCTCCAACAC AGCCTACCTG 300 

35 CACCTCAGCA GCCTGACATC TGAGGACACT GCCGTCTATT ACTGTCATGT CCTGATCTAT 360 



GCTGGTTATT TGGCTATGGA CTACTGGGGT CAAGGAACCT CAGTCGCCGT GAGCTCGGCT 



420 
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AGCACCAAGG GACCATCGGT CTTCCCCCTG GCCCCCTGCT CCAGGAGCAC CTCCGAGAGC 480 

ACAGCCGCCC TGGGCTGCCT GGTCAAGGAC TACTTCCCCG AACCGGTGAC GGTGTCGTGG 540 

5 AACTCAGGCG CTCTGACCAG CGGCGTGCAC ACCTTCCCGG CTGTCCTACA GTCCTCAGGA 600 

CTCTACTCCC TCAGCAGCGT CGTGACGGTG CCCTCCAGCA ACTTCGGCAC CCAGACCTAC 660 

ACCTGCAACG TAGATCACAA GCCCAGCAAC ACCAAGGTGG ACAAGACAGT TGAGCGCAAA 720 

10 

TGTTGTGTCG AGTGCCCACC GTGCCCGGCG CCACCTGTGG CCGGC 765 
(2) INFORMATION FOR SEQ ID NO: 19: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



Met Lys Leu Trp Leu Asn Trp He Phe Leu Val Thr Leu Leu Asn G 



1 5 10 



30 



15 



35 



He Gin Cys Glu Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg 
20 25 30 

Ser Gly Ala Ser Val Lys Leu Ser Cys Thr Ala Ser Gly Phe Asn He 
35 40 45 

Lys Asp Asn Tyr Met His Trp Val Lys Gin Arg Pro Glu Gin Gly Leu 
50 55 60 
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Glu Trp lie Ala Trp lie Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala 

80 



65 70 75 



Pro Lys Phe Arg Gly L ys Ala Thr Leu Thr Ala Asp Ser Ser Ser 



Asn 



85 90 95 



Thr Ala Tyr Leu His Leu Ser Ser Leu Thr Ser Glu Asp Thr Ala Val 
100 105 110 

Tyr Tyr Cys His Val Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr 
115 120 125 

Trp Gly Gin Gly Thr Ser Val Ala Val Ser Ser Ala Ser Thr Lys Gly 
130 135 140 

Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Glu Ser 
145 150 155 160 

Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
165 170 175 



Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val Hi 



180 185 



s Thr Phe 
190 



Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val 
195 200 2 o5 



Thr Val Pro Ser Ser Asn Phe Gly Thr Gin Thr Tyr Thr Cys Asn 



210 215 



Val 
220 



Asp His Lys Pro Ser Asn Thr Lys Val Asp Lys Thr Val Glu Arg Lys 



225 2 3o 235 



240 



Cys Cys Val Glu Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly 
245 250 255 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser L 
Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp ? 



ys 



20 25 



30 



Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40. 45 



Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Se 
50 " 60 



Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

65 7 ° 7S 

75 80 



Tyr lie Cys Asn Val Asn His Asn Pro Ser Asn Thr Lys Val Asp 



Lys 



85 



90 



95 



Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys 
100 105 no 



Pro Ala Pro Glu Leu Leu Gly Gly Pro 
US 120 
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(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



105 



15 



Ui) MOLECULE TYPE: other nucleic acid 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 



60 



GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 120 
25 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 180 



GGACTCTACT CCCTCAGCAG CGTGGTGACT GTGCCCTCCA GCAGCTTGGG CACCCAGACC 



240 



30 



TACATCTGCA ACGTGAATCA CAACCCCAGC AACACCAAGG TCGACAAGAA AGTTGAGCCC 
AAATCTTGTG ACAAGACGCA CACGTGCCCG CCGTGCCCGG CTCCGGAACT GCTGGGTGGC 



300 



360 



CCGTAATAG 



369 



35 (2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 



15 



1 5 10 15 



Ser Thr Ser Glu Ser Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 

50 55 60 



Leu Ser Ser Val Val Thr Val Pro Ser Ser Asn Phe Gly Thr Gin Thr 

Tyr Thr Cys Asn Val Asp His Lys Pro. Ser Asn Thr Lys Val Asp Lys 
85 90 95 



Thr Val Glu Arg Lys Cys Cys Val Glu Cys Pro Pro Cys Pro Ala 
100 105 110 



Pro 



Pro Val Ala Gly 
115 




PCT/GB97JD1165 



- 107- 

(2) INFORMATION FOR SEQ ID NO: 23: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

10 



■ % Q <xi> SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

*0 15 

p GCTAGCACCA AGGGACCATC GGTCTTCCCC CTGGCCCCCT GCTCCAGGAG CACCTCCGAG 60 

I E 5 

^ AGCACAGCCG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 120 



5. J 



» 20 TGGAACTCAG GCGCTCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT 



ACAGTCCTCA 180 



W GGACTCTACT CCCTCAGCAG CGTCGTGACG GTGCCCTCCA GCAACTTCGG CACCCAGACC 240 

TACACCTGCA ACGTAGATCA CAAGCCCAGC AACACCAAGG TGGACAAGAC AGTTGAGCGC 300 

25 

AAATGTTGTG TCGAGTGCCC ACCGTGCCCG GCGCCACCTG TGGCCGGC 34 8 

(2) INFORMATION FOR SEQ ID NO: 24: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
1 5 10 15 

Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 60 



Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Th 



nr 



65 70 75 80 



Tyr Thr Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys 
« 90 95 

Arg Val Glu Leu Lys Thr Pro Leu Gly Asp Thr Thr His Thr Cys Pro 
100 105 no 

Arg Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg 
115 120 X25 

Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys 
130 135 14Q 

Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys Pro 
145 < 150 155 160 

Ala Pro Glu Leu Leu Gly Gly 
165 

(2) INFORMATION FOR SEQ ID NO: 25: 
ti) SEQUENCE CHARACTERISTICS: 



WO 91/42329 




PCT/GB97/01IL6S 



- 109- 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: other nucleic acid 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCGCCCT GCTCCAGGAG CACCTCTGGG 60 
15 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG . 120 

TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 180 

GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 240 

20 

TACACCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAG AGTGGAGCTG 300 

AAAACCCCAC TTGGTGACAC AACTCACACG TGCCCTAGGT GTCCTGAACC TAAATCTTGT 360 

25 GACACACCTC CCCCGTGCCC ACGGTGCCCA GAGCCCAAAT CTTGCGACAC GCCCCCACCG 4 20 

TGTCCCAGAT GTCCTGAACC AAAGAGCTGT GACACTCCAC CGCCCTGCCC GAGGTGCCCA 4 80 
GCACCTGAAC TCCTGGGAGG A 



30 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 



501 
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<ii) MOLECULE TYPE: peptide 



3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ser Ala Ser Ser Ser Val Thr Tyr Met His 
1 S 10 

0 (2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

iii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ser Thr Ser Asn Leu Ala Ser 

1 5 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



tii) MOLECULE TYPE: peptide 




- Ill - 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

5 Gin Gin Arg Ser Thr Tyr Pro Leu Thr 

1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

10 



\Q ti) SEQUENCE CHARACTERISTICS: 

J*f 15 (A) LENGTH: 5 amino acids 

C3 (B > TYPE: amino acid 



t n 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



l^t 20 (ii) MOLECULE TYPE: peptide 



I U 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Asp Asn Tyr Met His 
1 5 




]PCT/GB97/0aiL<S5 



30 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



WO 97/42329 W FCIKMWttMS 
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5 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Phe Asn lie Lys Asp Asn Tyr Met His 
1 5 

0 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

5 <C> STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Trp He Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe Arg 

15 



1 5 10 



Gly 



(2) INFORMATION FOR SEQ ID NO: 32: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



WO 97/42329 
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(ii) MOLECULE TYPE: peptide 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



Leu He Tyr Ala Gly Tyr Leu Ala Met Asp Tyr 
1 5 10 

10 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

25 

His Val Leu He Tyr Ala Gly Tyr Leu Ala Met Asp Tyr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 34: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



WO 97/42329 » ^ PCT/GE97/01165 



-114- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TCGAGAGATC TAAGCTTCCG CGGGAATTCC TCGAGGAGCT CCCCGGGGGA TCCGTCGACT 60 

5 

(2) INFORMATION FOR SEQ ID NO: 35: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CTAGAGTCGA CGGATCCCCC GGGGAGCTCC TCGAGGAATT CCCGCGGAAG CTTAGATCTC 6C 



(2) INFORMATION FOR SEQ ID NO: 36: 

25 

Ei) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



WO 97/42329 



115- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



AAGCTTCCCG GGTATTAAAG CAGAACTTG 



5 (2) INFORMATION FOR SEQ ID NO: 37: 



PCT/GB97/D1S65 



29 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
20 ACTAGTGGAT CCCAGACATG ATAAGATAC 



- 4> 



29 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGTCTATATA AGCAGAGCTG TCTGGCTAAC TAGAGAACC 



39 



WO 97/42329 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GGTTCTCTAG TTAGCCAGAC AGCTCTGCTT ATATAGACC 39 
15 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(P) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
30 GGACTTTCCT ACTTGGCAG 

19 

(2) INFORMATION FOR SEQ ID* NO: 41: 

(ii SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



WO 97/42329 



PCT/GB97/01165 



-117- 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGCAACTAGA AGGCACAGTC 

10 

(2) INFORMATION FOR SEQ ID NO: 42: 

fi> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 77 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii} MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

25 

AGCTTGCCGC CACCATGGAT TTTCAAGTGC AGATTTTCAG CTTCCTGCTA ATCAGTGCTT 
CAGTCATAAT GTCCCGC 
30 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



PCT/GB97/01165 

-118- 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

GGGACATTAT GACTGAAGCA CTGATTAGCA GGAAGCTGAA AATCTGCACT TGAAAATCCA 
TGGTGGCGGC A 

10 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 61 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

25 

AGCTTGCCGC CACCATGAAG TTGTGGCTGA ACTGGATTTT CCTTGTAACA CTTTTAAATG 



30 (2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic 





WO 97/42329 

PCT/GB97/01165 
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5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

AATTCCATTT AAAAGTGTTA CAAGGAAAAT CCAGTTCAGC CACAACTTCA TGGTGGCGGC 



A 

10 



15 



is c 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

<ii) MOLECULE TYPE : other nucleic acid 



a 20 



25 



60 



61 



60 



120 



txi) SEQUENCE DESCRIPTION : SEQ ID NO: 46: 
AAGCTTCTCG AGATCAAACG GACTGTGGCT GCACCATCTG TCTTCATCTT CCCGCCATCT 
GATGAGCAGT TGAAATCTGG AACTGCCTCT GTTGTGTGCC TGCTGAATAA CTTCTATCCC 
30 AGAGAGGCCA AAGTACAGTG GAAGGTGGAT AACGCCCTCC AATCGGGTAA CTCCCAGGAG 180 
AGTGTCACAG AGCAGGACAG CAAGGACAGC ACCTACAGCC TCAGCAGCAC CCTGACGCTG 
AGCAAAGCAG ACTACGAGAA AC ACAAAGT C TACGCCTGCG AAGTCACCCA TCAGGGCCTG 



240 



300 



35 

AGTTCGCCCG TCACAAAGAG CTTCAACAGG GGAGAGTGTT AATAGCCCGG GACTAGT 357 



(2) INFORMATION FOR SEQ ID NO: 47: 



WO 97/42329 



- 120 - 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
15 GGAAGCTTGA GCTCGGCTAG CACCAAGGGA CCATCGGTCT TCCCCCTGGC CCCCTGCTCC 
AGGAGCACCT CCGAGAGCAC AGCCGCCCTG GGCTGCCTGG TCAAGGACTA CTTCCCCGAA 
CCGGTGACGG TGTCGTGGAA CTCAGGCGCT CTGACCAGCG GCGTGCACAC CTTCCCGGCT 

20 

GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTCG TGACGGTGCC CTCCAGCAAC 
TTCGGCACCC AGACCTACAC CTGCAACGTA GAT CACAAGC CCAGCAACAC CAAGGTGGAC 
25 AAGACAGTTG AGCGCAAATG TTGTGTCGAG TGCCCACCGT GCCCGGCGCC ACCTGTGGCC 



GGCTAATAGC CCGGGACTAG T 
(2) INFORMATION FOR SEQ ID NO: 48: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



381 



(ii) MOLECULE TYPE: other nucleic acid 



WO 97/42329 



0 




PCT/GB97/01165 



121 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



AAGCTTTCCC GCGGCGACAT CCAGATGACC CAGAGCCCAA GCAGCCTGAG CGCTAGCGTG €0 
GGTGACAGAG TGACCATCAC GTGTAGTGCC AGCTCAAGTG TAACTTACAT GCACTGGTAC 120 
CAGCAGAAGC CAGGTAAGGC TCCAAAGCTG CTGATCTACA GCACATCCAA CCTGGCTTCT 180 



10 GGTGTGCCAA GCAGATTCTC CGGAAGCGGT AGCGGCACCG ACTACACCTT 



CACCATCAGC 240 



*B 15 



n 

m 



20 



AGCCTCCAGC CAGAGGATAT CGCCACCTAC TACTGCCAGC AGAGGAGTAC TTACCCGCTC 
ACGTTCGGCC AAGGGACCAA GCTCGAGATC AAACGGACTA GT 
(2) INFORMATION FOR SEQ ID NO : 49: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



300 



342 



25 



(ii) MOLECULE TYPE: other nucleic acid 



30 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 49: 



GACATCCAGA TGACCCAGAG CCCAAGCAGC CTGAGCGCTA GCGTGGGTGA CAGAGTGACC 



60 



ATCACGTGTA GTGCCAGCTC AAGTGTAACT TACATGCACT GGTACCAGCA GAAGCCAGGT 



120 



35 AAGGCTCCAA AGCTGCTGAT CTACAGCACA TCCAACCTGG 



CTTCTGGTGT G CCAAGCAG A 180 



TTCTCCGGAA GCGGTAGCGG CACCGACTAC ACCTTCACCA TCAGCAGCCT CCAG CCAGAG 



240 




WO 97/42329 
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GATATCGCCA CCTACTACTG CCAGCAGAGG AGTACTTACC CGCTCACGTT CGGCCAAGGG 300 



ACCAAGCTCG AGATCAAACG G 
5 (2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



321 



10 



U 15 



AS 



25 



30 



35 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



Jjj 20 Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 

\'fl 1 5 



10 15 



Asp Arg Val Thr He Thr Cys Ser Ala Ser Ser Ser Val Thr Tyr Met 
20 25 30 

His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr 



35 40 



45 



Ser Thr Ser Asn Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 



50 



55 



60 



Gly ser Gly Thr Asp Tyr .Thr Phe Thr lie Ser Ser Leu Gin Pro Glu 
" 70 75 B0 



Asp He Ala Thr Tyr Tyr Cys Gin Gin Arg Ser Thr Tyr Pro Leu 



Thr 



85 



90 



95 




10 



WO 97/42329 
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Phe Gly Gin Gly Thr Lys Leu Glu lie Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 




PCT/GB97/01165 



H 15 



(ii) MOLECULE TYPE: other nucleic acid 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
ATGGATTTTC AAGTGCAGAT TTTCAGCTTC CTGCTAATCA GTGCTTCAGT CATAATGTCC 
CGCGGCGACA TCCAGATGAC CCAGAGCCCA AGCAGCCTGA GCGCTAGCGT GGGTGACAGA 
20 GTGACCATCA CGTGTAGTGC CAGCTCAAGT GTAACTTACA TGCACTGGTA CCAGCAGAAG 
CCAGGTAAGG CTCCAAAGCT GCTGATCTAC AGCACATCCA ACCTGGCTTC TGGTGTGCCA 
AGCAGATTCT CCGGAAGCGG TAG CGGCACC GACTACACCT TCAC CATCAG CAGCCTCCAG 

25 

CCAGAGGATA TCGCCACCTA CTACTGCCAG CAGAGGAGTA CTTACCCGCT CACGTTCGGC 
CAAGGGACCA AGCTCGAGAT CAAACGGACT GTGGCTGCAC CATCTGTCTT CATCTTCCCG 
30 CCATCTGATG AG CAGTTGAA ATCTGGAACT GCCTCTGTTG TGTGCCTGCT GAATAACTTC 
TATCCCAGAG AGGCCAAAGT ACAGTGGAAG GTGGATAACG CCCTCCAATC GGGTAACTCC 



35 



CAGGAGAGTG TCACAGAGCA GGACAGCAAG GACAGCACCT ACAGCCTCAG CAGCACCCTG 



ACGCTGAGCA AAGCAGACTA CGAGAAACAC AAAGTCTACG CCTGCGAAGT CACC CATCAG 



GGCCTGAGTT CGCCCGTCAC AAAGAGCTTC AACAGGGGAG AGTGT 



60 



120 



ieo 



240 



300 



360 



42.Q 



460 



540 



600 



660 



705 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Asp Phe Gin Val Gin He Phe Ser Phe Leu Leu He Ser Ala Ser 
1 5 10 15 



Val He Met Ser Arg Gly Asp He Gin Met Thr Gin Ser Pro Ser Ser 

30 



20 25 



Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr Cys Ser Ala Ser 
35 40 45 



Ser Ser Val Thr Tyr Met His Trp Tyr Gin Gin 

60 



Lys Pro Gly Lys Ala 

5 ° 55 



Pro Lys Leu Leu He Tyr Ser Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 70 



75 



80 



Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He 
85 90 



95 



Ser Ser Leu Gin Pro Glu Asp He Ala Thr Tyr Tyr Cys Gin Gin Arg 



100 105 



110 



WO 97/42329 
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Ser Thr Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys 



Leu Glu lie Lys 



115 120 a25 



Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp Glu 
130 135 140 

Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe 
" 5 150 155 160 

Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin 
165 "0 



Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp 



Ser 



180 



1S5 



190 



Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 



n 195 200 205 



Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Sex 
210 215 



220 

TO 

Q Pro Val Thr L Y S Se ^ Phe Asn Arg Gly Glu Cys 

225 230 



235 



25 (2) INFORMATION FOR SEQ. ID NO: 



53 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 85 base pairs 

(B) TYPE: nucleic acid 
30 <C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



35 



WO 97/42329 _ PCT/GB97/01165 

- 126 - 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 53: 



10 

TAGTAAGAAC CAGTTCAGCC TGAGACTCAG CAGCGTGACA GCCGCCGACA CCGCGGTCTA 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 360 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 



SO 



GAAGCTTGGA ATTCAGTGTG AGGTGCAGCT GCAGCAGAGC GGTCCAGGTC TCGTACGGCC 
5 TAGCCAGACC CTGAGCCTCA CGTGCACCGC ATCTGGCTTC AACATTAAGG ACAATTACAT 120 
GCACTGGGTG AGACAGCCAC CTGGACGAGG CCTTGAGTGG ATTGGATGGA TTGACCCTGA 
GAATGGTGAC ACTGAGTACG CACCTAAGTT TCGCGGCCGC GTGACAATGC TGGCAGACAC 



1B0 



240 



300 



TTATTGTCAC GTCCTGATAT ACGCCGGGTA TCTGGCAATG GACTACTGGG GCCAAGGGAC 360 
15 CCTCGTCACC GTGAGCTCGA CTAGT 



365 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

30 GAGGTGCAGC TGCAGCAGAG CGGTCCAGGT CTCGTACGGC CTAGCCAGAC CCTGAGCCTC 
ACGTGCACCG CATCTGGCTT CAACATTAAG GACAATTACA TGCACTGGGT GAGACAGCCA 

^ C CTGGACG AG GCCTTGAGTG GATTGGATGG ATTGACCCTG AGAATGGTGA CACTGAGTAC 
GCACCTAAGT TTCGCGGCCG CGTGACAATG CTGGCAGACA CTAGTAAGAA CCAGTTCAGC 
CTGAGACTCA GCAGCGTGAC AGCCGCCGAC ACCG CGGTCT ATTATTGTCA CGTCCTGATA 




WO 97/42329 _ W PCT/GB97/01165 
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TACGCCGGGT ATCTGGCAAT GGACTACTGG GGCCAAGGGA CCCTCGTCAC CGTGAGCTCG 360 



5 (2) INFORMATION FOR SEQ ID NO: 5S : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



U 15 



H 20 



25 



30 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg Pro Ser Gin 
1 5 10 15 



Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie Lys Asp 
20 25 30 



Asn 



Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu Glu Trp lie 
35 40 45 

Gly Trp lie Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe 
50 " 60 

Arg Gly Arg Val Thr Met Leu Ala Asp Thr Ser Lys Asn Gin Phe Ser 

75 



35 



Leu Arg Leu Ser Ser Val 
85 



Ala Ala Asp Thr Ala Val Tyr Tyr Cys 
90 9s 



WO 97/42329 
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His VI Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp ciy Gin 



100 105 



Gly Thr Leu Val Thr Val Ser Ser 
5 115 120 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 765 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: other nucleic acid 



110 



20 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 56: 

ATGAAGTTGT GGCTGAACTG GATTTTCCTT rnnr.^ 

TTCCTT GTAACACTTT TAAATGGAAT TCAGTGTGAG 



60 



^ GTGCAGCTGC AGCAGAGCGG TCCAGGTCTC GTACGGCCTA GCCAGACCCT GA5CCTCACG la „ 

TGCACCGCAT CTGGCTTCAA CATTAAGGAC AATTACATG C ACTGGGTGAG ACAGCCACCT 1B0 

GGACGAGGCC TTGAGTGGAT TGGATGGATT GACCCTGAGA ATGGTGACAC TGAGTACGCA 240 

30 CCTAAGTTTC GCGGCCGCGT GACAATG CTG GCAGACACTA GTAAGAACCA GTTCAGCCTG 300 

AGACTCAGCA GCGTGACAGC CGCCGACACC GCGGTCTATT ATTGTCACGT CCTGATATAC 360 

^ GCCGGGTATC TGG CAATGGA CTACTGGGGC CAAGGGACCC TCGTCACCGT GAGCTCGGCT « 20 

AGCACCAAGG GACCATCGGT CTTCCCCCTG GCCCCCTGCT CCAGGAGCAC CTCCGAGAGC « 8 0 
ACAGCCGCCC TGGG CTG CCT GGTCAAGGAC TACTTCCCCG AACCGGTGAC GGTGTCGTGG " 540 



WO 97/42329 
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AACTCAGGCG CTCTGACCAG CGGCGTGCAC ACCTTCCCGG CTGTCCTACA GTCCTCAGGA 
CTCTACTCCC TCAGCAGCGT CGTGACGGTG CCCTCCAGCA ACTTCGGCAC CCAGACCTAC 

5 

ACCTGCAACG TAGATCACAA GCCCAGCAAC ACCAAGGTGG ACAAGACAGT TGAGCGCAAA 
TGTTGTGTCG AGTGCCCACC GTGCCCGGCG CCACCTGTGG CCGGC 
0 (2) INFORMATION FOR SEQ ID NO: 57 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Lys Leu Trp Leu Asn Trp He Phe Leu Val Thr Leu Leu Asn Gly 



10 



15 



He Gin Cys Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg 
20 " 30 

Pro Ser Gin Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie 



35 



40 



45 



Lys Asp Asn Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu 



55 60 



Glu Trp He Gly Trp He Asp Pro Glu Asn 

75 B0 



Gly Asp Thr Glu Tyr Ala 

65 70 
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Pro Lys Phe Arg Gly Arg Val Thr M t Leu Ala Asp Thr Ser Lys Asn 
8S 90 95 

Gin Phe Ser Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val 
100 105 110 

Tyr Tyr Cys His Val Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr 
115 120 12S 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
130 "5 140 

Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Glu Ser 
145 150 155 160 



Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 

175 



165 170 



Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe 
180 18S 



190 



Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val 
«S 200 



205 



Thr Val Pro Ser Ser Asn Phe Gly Thr Gin Thr Tyr Thr Cys Asn 



Val 



210 



215 



220 



Asp His Lys Pro Ser Asn Thr Lys Val Asp Lys Thr Val Glu Arg Lys 
2 " 230 235 



Cys Cys Val Glu Cys Pro Pro Cys Pro Ala Pro Pro Val Ala 

255 



Gly. 

2« 250 



(2) INFORMATION FOR SEQ ID NO: 58: 



(il SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 



WO 97/42329 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GGCGACATCC AGCTGACCCA GAGCCCAAGC AGCCTGAGCG 

10 

(2) INFORMATION FOR SEQ ID NO: 59: 



15 



<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE; nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : other nucleic acid 



40 



20 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

25 

CTAGCGCTCA GGCTGCTTGG GCTCTGGGTC AG CTGGATGT CGCCGC 
(2) INFORMATION FOR SEQ ID NO: 60: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: other nucleic acid 



46 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6$: 
GACATCCAGC TGACCCAGAG CCCAAGCAGC CTGAGCGCTA GCGTGGGTGA CAGAGTGACC 60 



5 ATCACGTGTA GTGCCAGCTC AAGTGTAACT TACATG CACT GGTACCAGCA 



GAAGCCAGGT 120 



10 



AAGGCTCCAA AGCTG CTGAT CTACAG CACA TCCAACCTGG CTTCTGGTGT GCCAAGCAGA 180 
TTCTCCGGAA GCGGTAGCGG CACCGACTAC ACCTTCACCA TCAGCAGCCT CCAGCCAGAG 240 
GATATCGCCA CCTACTACTG CCAGCAGAGG AGTACTTACC CGCTCACGTT CGGCCAAGGG 300 



ACCAAG CTCG AGATCAAACG G 



321 



15 (2) INFORMATION FOR SEQ ID NO: 61: 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



30 Asp lie Gin 



Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
1 5 



10 



15 



35 



Asp Arg Val Thr lie Thr Cys Ser Ala Ser Ser Ser Val Thr Tyr Met 
20 25 



30 



His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr 

35 *n 
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Ser Thr Ser Asn Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 



Gly Ser Gly Thr Asp Tyr Thr Phe Thr lie Ser Ser Leu Gin Pro Glu 
5 65 70 75 80 



Asp He Ala Thr Tyr Tyr Cys Gin Gin Arg Ser Thr Tyr Pro Leu Thr 
85 90 95 

10 Phe Gly Gin Gly Thr Lys Leu Glu He Lys Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 62: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: other nucleic acid 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GGCCAGATCG TGCTGACCCA GAGCCCAAGC AG CCTGAGCG 
30 (2) INFORMATION FOR SEQ ID NO: 63: 



40 



<i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 6 base pairs 
IB) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



lii) MOLECULE TYPE: other nucleic acid 
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5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: €3: 

CTAGCGCTCA GGCTGCTTGG GCTCTGGGTC AGCACGATCT GGCCGC 
(2) INFORMATION FOR SEQ ID NO: 64: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 
m <B) TYPE: nucleic acid 

SL-J 

=Jp] (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



\Q 15 



i r! 



(ii) MOLECULE TYPE: other nucleic acid 



P 20 

fu 

hi 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CAGATCGTGC TGACCCAGAG CCCAAGCAGC CTGAGCGCTA GCGTGGGTGA CAGAGTGACC 



35 

(2) INFORMATION FOR SEQ ID NO: 65: 



46 



60 



25 

ATCACGTGTA GTGCCAGCTC AAGTGTAACT TACATGCACT GGTACCAGCA GAAGCCAGGT 120 
AAGGCTCCAA AGCTGCTGAT CTACAGCACA TCCAACCTGG CTTCTGGTGT GCCAAGCAGA 
30 TTCTCCGGAA GCGGTAGCGG CACCGACTAC ACCTTCACCA TCAGCAGCCT CCAGCCAGAG 
GATATCGCCA CCTACTACTG . CCAGCAGAGG AGT ACTTAC C CG CTCACGTT CGGCCAAGGG 

ACCAAGCTCG AGATCAAACG G 



240 



321 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



Gin lie Val Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
1 5 10 15 



Gly 



Asp Arg Val Thr lie Thr Cys Ser Ala Ser Ser Ser Val Thr Tyr Met 
20 25 30 



His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr 
35 40 45 

Ser Thr Ser Asn Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr lie Ser Ser Leu Gin Pro Glu 
65 70 75 so 

Asp lie Ala Thr Tyr Tyr Cys Gin Gin Arg Ser Thr Tyr Pro Leu Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu He Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CGTATTAGTC ATCGCTATTA CC 

22 

(2) INFORMATION FOR SEQ ID NO: 67 : 

15 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

GTTGGATGTG CTGTAGATCC ACAG CTTTGG AGCCTTACC 
(2) INFORMATION FOR SEQ ID NO: 68: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

5 

TCCGTTTGAT CTCGAGCTTG G 
(2) INFORMATION FOR SEQ ID NO: 69: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: other nucleic acid 



20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: €9: 
GGTAAGGCTC CAAAGCTGTG GATCTACAGC ACATCCAAC 
25 (2) INFORMATION FOR SEQ ID NO: 70: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 21 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

35 



WO 97/42329 



60 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GACATCCAGA TGACCCAGAG CCCAAGCAGC CTGAGCGCTA GCGTGGGTGA CAGAGTGACC 
5 ATCACGTGTA GTGCCAG CTC AAGTGTAACT TACATGCACT GGTACCAGCA GAAGCCAGGT 120 
AAGGCTCCAA AG CTGTGGAT CTACAGCACA TCCAACCTGG CTTCTGGTGT GCCAAGCAGA 
TTCTCCGGAA GCGGTAGCGG CACCGACTAC ACCTTCACCA TCAGCAGCCT CCAGCCAGAG 



180 



10 

GATATCGCCA CCTACTACTG CCAGCAGAGG AGTACTTACC CGCTCACGTT CGGCCAAGGG 

ACCAAGCTCG AGATCAAACG G 
15 <2) INFORMATION FOR SEQ ID NO: 71; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



240 



300 



321 



20 



<ii) MOLECULE TYPE : protein 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 71: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 

1 

Asp Arg Val Thr lie Thr Cys Ser Ala Ser Ser Ser Val Thr Tyr Met 

2S 

His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Trp lie Tyr 
35 40 45 
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Ser Thr Ser Asn Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 



50 



55 60 



Gly Ser Gly Thr Asp Tyr Thr Phe Thr lie Ser Ser Leu Gin Pro Glu 
5 65 70 75 80 

Asp He Ala Thr Tyr Tyr Cys Gin Gin Arg Ser Thr Tyr Pro Leu Thr 
85 90 95 

10 Phe Gly Gin Gly Thr Lye Leu Glu He Lys Arg 



in 



100 

(2) INFORMATION FOR SEQ ID NO: 72: 

15 U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



105 



20 



(ii) MOLECULE TYPE: other nucleic acid 



Sal 

f ^ 



25 

<"xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CCTTGAGTGG ATTGCATGGA TTGACCCTGA GAATGGTGAC ACTGAGTACG CACCTAAGTT 60 
30 TCGC 



64 



(2) INFORMATION FOR SEQ ID NO: 73: 
(i) SEQUENCE CHARACTERISTICS: 



35 



(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 



U 20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

10 

GGCCGCGAAA CTTAGGTGCG TACTCAGTGT CACCATTCTC AGGGTCAATC CATGCAATCC 
ACTCAAGG 



15 (2) INFORMATION FOR SEQ ID NO: 74: 



<i> SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 3 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



25 



60 



66 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 : 
30 GAGGTGCAGC TGCAGCAGAG CGGTCCAGGT CTCGTACGGC CTAG CCAG AC CCTGAGCCTC 60 
ACGTGCACCG CATCTGGCTT CAACATTAAG GACAATTACA TGCACTGGGT GAGACAGCCA 
CCTGGACGAG GCCTTGAGTG GATTGCATGG ATTGACCCTG AGAATGGTGA CACTGAGTAC 180 

35 

GCACCTAAGT TTCGCGGCCG CGTGACAATG CTGGCAGACA CTAGTAAGAA CCAGTTCAGC 
CTGAGACTCA GCAGCGTGAC AGCCGCCGAC ACCGCGGTCT ATTATTGTCA CGTCCTGATA 300 




5 
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TACGCCGGGT ATCTGGCAAT GGACTACTGG GGCCAAGGGA CCCTCGTCAC CGTGAGCTCG 360 
(2) INFORMATION FOR SEQ ID NO: 75; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg Pro Ser Gin 
20 5 10 1S 



Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie Lys Asp Asn 
20 25 30 

25 Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu Glu Trp He 

35 40 45 

Ala Trp He Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe 
50 55 60 



30 



35 



er 



Arg Gly Arg Val Thr Met Leu Ala Asp Thr Ser Lys Asn Gin Phe S 
65 70 75 bo 



Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

His Val Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp Gly Gin 
100 105 HO 
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Gly Thr Leu Val Thr Val Ser Ser 
115 120 

5 (2) INFORMATION FOR SEQ ID NO: 76: 



142 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



20 GGCCGCGTGA CAATGCTGGC AGACTCAAGT AAGAACCAGG CCAGCCTGAG 



ACTCAGCAGC 



60 



GTGACAGCCG CCGACACCGC 



80 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 



(iii MOLECULE TYPE: other nucleic acid 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



GGTGTCGGCG GCTGTCACGC TGCTGAGTCT CAGGCTGGCC TGGTTCTTAC TTGAGTCTGC 6C 



5 CAGCATTGTC ACGC 



74 



(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 



10 



15 



(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



25 GAGGTGCAGC TGCAGCAGAG CGGTCCAGGT CTCGTACGGC CTAGCCAGAC 



CCTGAGCCTC 60 



ACGTGCACCG CATCTGGCTT CAACATTAAG GACAATTACA TGCACTGGGT GAGACAGCCA 120 



CCTGGACGAG GCCTTGAGTG GATTGGATGG ATTGACCCTG AGAATGGTGA CACTGAGTAC 180 



30 



GCACCTAAGT TTCGCGGCCG CGTGACAATG CTGG CAG ACT CAAGTAAGAA CCAGGCCAGC 240 



CTGAGACTCA GCAGCGTGAC AGCCGCCGAC ACCGCGGTCT ATTATTGTCA CGTCCTGATA 300 



35 TACGCCGGGT ATCTGGCAAT GGACTACTGG GG CCAAGGG A CCCTCGTCAC 



CGTGAGCTCG 360 
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(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Giu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg. Pro Ser Gin 
1 5 10 15 



Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie Lys Asp Asn 
20 25 30 



Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu Glu Trp lit 

45 



35 40 



Gly Trp lie Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe 
50 55 60 

Arg Gly Arg Val Thr Met Leu Ala Asp Ser Ser Lys Asn Gin Ala Ser 
65 70 75 eo 

Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

His Val Leu He Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp Gly Gin 
100 105 110 



Gly Thr Leu Val Thr Val Ser Ser 
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<2) INFORMATION FOR SEQ ID NO; 80: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 360 base pairs 

<B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: other nucleic acid 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

GAGGTGCAGC TGCAGCAGAG CGGTCCAGGT CTCGTACGGC CTAG CCAG AC CCTGAGCCTC 
ACGTG CACCG CATCTGGCTT CAACATTAAG GACAATTACA TGCACTGGGT GAGACAG CCA 

20 

CCTGGACGAG GCCTTGAGTG GATTGCATGG ATTGACCCTG AGAATGGTGA CACTGAGTAC 
GCACCTAAGT TTCGCGGCCG CGTGACAATG CTGGCAGACT CAAGTAAGAA CCAGGCCAGC 
25 CTGAGACTCA GCAGCGTGAC AGCCGCCGAC ACCGCGGTCT ATTATTGTCA CGTCCTGATA 
TACGCCGGGT ATCTGGCAAT GGACTACTGG GGCCAAGGGA CCCTCGTCAC CGTGAGCTCG 

30 (2) INFORMATION FOR SEQ ID NO : 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

35 <C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



60 
120 
180 
240 
300 
360 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg Pro Ser Gin 
5 1 s 10 1S 

Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn He Lys Asp Asn 
20 25 30 

10 xyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu Glu Trp He 

35 40 45 



15 



20 



25 



Ala Trp He Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe 
50 55 go 



Arg Gly Arg Val Thr Met Leu Ala Asp Ser Ser Lys Asn Gin Ala Ser 

75 80 



Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 



His Val Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp Gly Gin 
100 105 110 

Gly Thr Leu Val Thr Val Ser Ser 

H5 120 



(2) INFORMATION FOR SEQ ID NO: B2 : 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: other nucleic acid 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



GGCCGCGCCA CAATGCTGGC AGACACTAGT AAGAACCAGT TCAGCCTGAG ACTCAGCAGC 
GTGACAGCCG CCGACACCGC 
10 (2) INFORMATION FOR SEQ ID NO: B3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGTGTCGGCG GCTGTCACGC TGCTGAGTCT CAGGCTGAAC TGG TTCTTAC TAGTGTCTGC 
CAG CATTGTG GCGC 

(2) INFORMATION FOR SEQ ID NO: 84: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



5 



15 



(C) STRANDEDNESS: single 



20 



30 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GAGGTGCAGC TGCAGCAGAG CGGTCCAGGT CTCGTACGGC CTAGCCAGAC CCTGAGCCTC 60 

5 

ACGTGCACCG CATCTGGCTT CAACATTAAG GACAATTACA TGCACTGGGT GAGACAG CCA 120 
CCTGGACGAG GCCTTGAGTG GATTGGATGG ATTGACCCTG AGAATGGTGA CACTGAGTAC 180 
10 GCACCTAAGT TTCGCGGCCG CGCCACAATG CTGGCAGACA CTAGTAAGAA CCAGTTCAGC 240 
CTGAGACTCA GCAGCGTGAC AGCCGCCGAC ACCGCGGTCT ATTATTGTCA CGTCCTGATA 300 
TACGCCGGGT ATCTGGCAAT GGACTACTGG GGCCAAGGGA CCCTCGTCAC CGTGAGCTCG 360 

15 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 



30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 85: 

Glu Va-1 Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg Pro Ser Gin 
1 5 10 15 

35 Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie Lys Asp Asn 
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Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu Glu Trp lie 
35 40 45 



Gly Trp He Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe 
5 50 55 €0 



Arg Gly Arg Ala Thr Met Leu Ala Asp Thr Ser Lys Asn Gin Phe Ser 
65 ™ 75 80 

10 Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys 

85 90 95 



His Val Leu He Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp Gly Gin 
100 105 no 

Gly Thr Leu Val Thr Val Ser Ser 
115 120 



15 



(2) INFORMATION FOR SEQ ID NO: 86: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: other nucleic acid 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GGCCGCGCCA CAATGCTGGC AGACTCAAGT AAGAACCAGG CCAGCCTGAG ACTCAGCAGC 60 

35 

GTGACAGCCG CCGACACCGC 

B0 
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(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 74 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

10 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

15 

GGTGTCGGCG GCTGTCACGC TGCTGAGTCT CAGGCTGGCC TGGTTCTTAC TTGAGTCTGC 60 



W 20 (2) INFORMATION FOR SEQ ID NO: BB : 



1== 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 0 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



tii) MOLECULE TYPE: other nucleic acid 



30 



74 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
35 GAGGTGCAGC TGCAGCAGAG CGGTCCAGGT CTCGTACGGC CTAGCCAGAC CCTGAGCCTC 60 
ACGTGCACCG CATCTGGCTT CAACATTAAG GACAATTACA TGCACTGGGT G AGACAG CCA 120 
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CCTGGACGAG GCCTTGAGTG GATTGGATGG ATTGACCCTG AGAATGGTGA CACTGAGTAC 
GCACCTAAGT TTCGCGGCCG CGCCACAATG CTGGCAGACT CAAGTAAGAA CCAGGCCAGC 
5 CTGAGACTCA GCAGCGTGAC AGCCGCCGAC ACCGCGGTCT ATTATTGTCA CGTCCTGATA 
TACGCCGGGT ATCTGGCAAT GGACTACTGG GGCCAAGGGA CCCTCGTCAC CGTGAGCTCG 



10 (2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(P) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



25 



Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg *™ Ser Gin 
15 10 15 



Thr Leu Ser Leu Thr Cys Thr Ala 
20 

30 

Tyr Met His Trp Val Arg Gin Pro 
35 40 

Gly Trp He Asp Pro Glu Asn Gly 
35 50 S5 

Arg Gly Arg Ala Thr Met Leu Ala 
65 70 



Ser Gly Phe Asn He Lys Asp Asn 
25 30 

Pro Gly Arg Gly Leu Glu Trp He 
45 

Asp Thr Glu Tyr Ala Pro Lys Phe 
60 

Asp Ser Ser Lys Asn Gin Ala Ser 

75 80 




WO — FC1VGB97M1165 
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Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 



5 His Val 



Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp Gly Gin 
10 ° 105 110 



□ 



in 



Gly Thr Leu Val Thr Val Ser Ser 
115 120 

10 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



EE 

S3 20 



Q 



(ii) MOLECULE TYPE: other nucleic acid 



25 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 90: 
GAGGTGCAGC TGCAGCAGAG CGGTCCAGGT CTCGTACGGC CTAGCCAGAC CCTGAGCCTC 60 
ACGTGCACCG CATCTGGCTT CAACATTAAG GACAATTACA TG CACTGGGT GAGACAGCCA 120 
30 CCTGGACGAG GCCTTGAGTG GATTGCATGG ATTGACCCTG AGAATGGTGA CACTGAGTAC 1 
GCACCTAAGT TTCGCGGCCG CGCCACAATG CTGGCAGACT CAAGTAAGAA CCAGGCCAGC 
CTGAGACTCA GCAGCGTGAC AGCCGCCGAC ACCGCGGTCT ATTATTGTCA CGTCCTGATA 



B0 



240 



300 



35 

TACGCCGGGT ATCTGG CAAT GGACTACTGG GGCCAAGGGA CCCTCGTCAC CGTGAGCTCG 360 
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(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg Pro Ser Gin 

Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn He Lys Asp Asn 
20 25 30 

Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu Glu Trp He 
35 40 45 

Ala Trp He Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe 
50 55 60 



Arg Gly Arg Ala Thr Met Leu Ala Asp Ser Ser Lys Asn Gin Ala Ser 

80 



65 7 o 75 



Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

His Val Leu He Tyr Ala Gly Tyr Leu Ala Met Asp Tyr Trp Gly Gin 

100 105 no 



Gly Thr Leu Val Thr Val Ser Ser 
US 120 



WO 97/42329 



-154- 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS .- single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



60 



120 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 92: 

15 

ATGAAGTTGT GGCTGAACTG GATTTTCCTT GTAACACTTT TAAATGGAAT TCAGTGTGAG 
GTGCAGCTGC AGCAGAGCGG TCCAGGTCTC GTACGGCCTA GCCAGACCCT GAGCCTCACG 
20 TGCACCGCAT CTGGCTTCAA CATTAAGGAC AATTACATGC ACTGGGTGAG ACAGCCACCT 180 
GGACGAGGCC TTGAGTGGAT TGGATGGATT GACCCTGAGA ATGGTGACAC TGAGTACGCA 240 
CCTAAGTTTC GCGGCCGCGT GACAATGCTG GCAGACACTA GTAAGAACCA GTTCAGCCTG 300 

25 

AGACTCAGCA GCGTGACAGC CGCCGACACC GCGGTCTATT ATTGTCACGT CCTGATATAC 360 



420 



GCCGGGTATC TGG CAATGGA CTACTGGGGC CAAGGGACCC TCGTCACCGT GAGCTCGGCC 
30 TCCACCAAGG GCCCATCGGT CTTCCCCCTG GCACCCTCCT CCAAGAGCAC CTCTGGGGGC 48 
ACAGCGGCCC TGGGCTGCCT GGTCAAGGAC TACTTCCCCG AACCGGTGAC GGTGTCGTGG 
AACTCAGGCG CCCTGACCAG CGGCGTGCAC ACCTTCCCGG CTGTCCTACA GTCCTCAGGA 
CTCTACTCCC TCAGCAGCGT GGTGACTGTG CCCTCCAGCA GCTTGGGCAC CCAGACCTAC 
ATCTG CAACG TGAATCACAA CCCCAGCAAC ACCAAGGTCG ACAAGAAAGT TGAGCCCAAA 



540 



600 



660 



720 
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TCTTGTGACA AGACGCACAC GTGCCCGCCG TGCCCGGCTC CGGAACTGCT GGGTGGCCCG 780 

5 (2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE : amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



^ 20 Met Lys Leu Trp Leu Asn Trp lie Phe Leu Val Thr Leu Leu Asn Gly 

ru 1 5 io 15 



fees? 



25 



lie Gin Cys Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg 
20 25 30 

Pro Ser Gin Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie 

35 40 45 



Lys Asp Asn Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu 
30 50 . SS 60 



Glu Trp lie Gly Trp lie Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala 

80 



65 70 75 



35 Pro Lys Phe Arg Gly Arg Val Thr Met Leu Ala Asp Thr Ser Lys Asn 

90 95 
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Gln Phe Ser Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val 
100 !°S 110 

Tyr Tyr Cys Hie Val Leu lie Tyr Ala Gly Tyr Leu Ala Met Asp Tyr 
115 120 125 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
130 "5 140 



Pro Ser Val Phe Pro Leu Ala Pro Ser Ser 



Lys Ser Thr Ser Gly Gly 



145 15 ° 155 leo 



Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
165 170 i7 5 

Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe 
180 185 190 

Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val 
195 200 205 

Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He Cys Asn Val 
210 215 220 

Asn His Asn Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys 
225 230 235 * 240 

Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu 
245 250 255 

Leu Gly Gly Pro 
260 

(2) INFORMATION FOR SEQ ID NO: 94: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pai 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

10 

ATGAAGTTGT GGCTGAACTG GATTTTCCTT GTAACACTTT TAAATGGAAT TCAGTGTGAG 60 

GTGCAGCTGC AGCAGAGCGG TCCAGGTCTC GTACGGCCTA GCCAGACCCT GAGCCTCACG 120 

15 TGCACCG CAT CTGGCTTCAA CATTAAGGAC AATTACATGC ACTGGGTGAG ACAGCCACCT 180 

GGACGAGGCC TTGAGTGGAT TGGATGGATT GACCCTGAGA ATGGTGACAC TGAGTACGCA 240 

CCTAAGTTTC GCGGCCGCGT GACAATGCTG GCAGACACTA GTAAGAACCA GTTCAG CCTG 300 

20 

AGACTCAGCA GCGTGACAGC CGCCGACACC GCGGTCTATT ATTGTCACGT CCTGATATAC 360 

/ 

GCCGGGTATC TGGCAATGGA CTACTGGGGC CAAGGGACCC TCGTCACCGT GAG CTCGGCT 420 

25 AGCACCAAGG GCCCATCGGT CTTCCCCCTG GCGCCCTGCT CCAGGAGCAC CTCTGGGGGC 4 SO 

ACAGCGGCCC TGGGCTGCCT GGTCAAGGAC TACTTCCCCG AACCGGTGAC GGTGTCGTGG 540 

AACTCAGGCG CCCTGACCAG CGGCGTGCAC ACCTTCCCGG CTGTCCTACA GTCCTCAGGA 600 

30 

CTCTACTCCC TCAGCAGCGT GGTGACCGTG CCCTCCAGCA GCTTGGGCAC CCAGACCTAC 660 

ACCTGCAACG TGAATCACAA GCCCAGCAAC ACCAAGGTGG ACAAGAGAGT GGAGCTGAAA 720 

35 ACCCCACTCG GTGACACAAC TCACACGTGC CCTAGGTGTC CTGAACCTAA ATCTTGTGAC 780 

ACACCTCCCC CGTGCCCACG GTGCCCAGAG CCCAAATCTT GCGACACGCC CCCACCGTGT 840 
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CCCAGATGTC CTGAACCAAA GAGCTGTGAC ACTCCACCGC CCTGCCCGAG GTGCCCAGCA 900 
CCTGAACTCC TGGGAGGG 918 
5 (2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D> TOPOLOGY: linear 

(ix) MOLECULE TYPE: protein 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Met Lys Leu Trp Leu Asn Trp lie Phe Leu Val Thr Leu Leu Asn Gly 

1 ^ 5 10 15 

lie Gin Cys Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg 

20 25 30 

Pro Ser Gin Thr Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie 

35 40 45 



20 



25 



30 Lys Asp Asn Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu 

50 55 60 



35 



Glu Trp He Gly Trp lie Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala 
65 70 75 80 



Lys Phe Arg Gly Arg Val Thr Met Leu Ala Asp Thr S r Lys Asn 
85 90 95 
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Gln Phe Ser Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val 
100 10S 110 



Tyr Tyr Cys His Val Leu He Tyr Ala Gly Tyr Leu Ala Met Asp Tyr 

125 



US 120 



Trp Gly Gin Gly 
130 

0 

Pro Ser Val Phe 
145 

Thr Ala Ala Leu 

5 

Thr Val Ser Trp 
ISO 

,0 Pro Ala Val Leu 

195 

Thr Val Pro Ser 
210 

.5 

Asn His Lys Pro 
225 

Thr Pro Leu Gly 

0 

Lys Ser Cys Asp 
260 

>5 ser Cys Asp Thr 

275 



Thr Leu Val Thr 
135 

Pro Leu Ala Pro 
150 

Gly Cys Leu Val 
165 

Asn Ser Gly Ala 

Gin Ser Ser Gly 
200 

Ser Ser Leu Gly 
215 

Ser Asn Thr Lys 
230 

Asp Thr Thr His 
245 

Thr Pro- Pro Pro 

Pro Pro Pro Cys 
280 



Val Ser Ser Ala 
140 

Cys Ser Arg Ser 
155 

Lys Asp Tyr Phe 
170 

Leu Thr Ser Gly 
185 

Leu Tyr Ser Leu 

Thr Gin Thr Tyr 
220 

Val Asp Lys Arg 
235 

Thr Cys Pro Arg 
250 

Cys Pro Arg Cys 
265 

Pro Arg Cys Pro 



Ser Thr Lys Gly 

Thr Ser Gly Gly 
160 

Pro Glu Pro Val 
175 

Val His Thr Phe 
190 

Ser Ser Val Val 
205 

Thr Cys Asn Val 

Val Glu Leu Lys 
240 

Cys Pro Glu Pro 
255 

Pro Glu Pro Lys 
270 

Glu Pro Lys Ser 
285 



WO 97/42329 
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Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys Pro Ala Pro Glu Leu Leu 
290 295 300 

Gly Gly 
305 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 <ii) MOLECULE TYPE: other nucleic acid 



10 



20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

ATGGATTTTC AAGTG CAGAT TTTCAGCTTC CTG CTAATCA GTG CTTCAGT CATAATGTCC 60 

CGCGGCCAGA TCGTGCTGAC CCAGAGCCCA AGCAG CCTGA GCGCTAGCGT GGGTGACAGA 120 

25 

GTG AC CAT CA CGTGTAGTGC CAG CTCAAGT GTAACTTACA TGCACTGGTA CCAGCAGAAG 180 

CCAGGTAAGG CTCCAAAGCT GCTGATCTAC AGCACATCCA ACCTGGCTTC TGGTGTGCCA 240 

30 AG CAGATTCT CCGGAAGCGG TAG CGG CACC GACTACACCT TCACCATCAG CAGCCTCCAG 300 

CCAGAGGATA TCGCCACCTA CTACTGCCAG CAGAGGAGTA CTTACCCGCT CACGTTCGGC 360 

CAAGGGACCA AGCTCGAGAT CAAACGGACT GTGGCTGCAC CATCTGTCTT CATCTTCCCG 420 

35 

CCATCTGATG AGCAGTTGAA ATCTGGAACT GCCTCTGTTG TGTGCCTGCT GAATAACTTC 4 60 



TATCCCAGAG AGG CCAAAGT ACAGTGGAAG GTGGATAACG CCCTCCAATC GGGTAACTCC 



540 
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CAGGAGAGTG TCACAGAGCA GGACAGCAAG GACAGCACCT ACAGCCTCAG CAGCACCCTG 600 
ACGCTGAGCA AAGCAGACTA CGAGAAACAC AAAGTCTACG CCTGCGAAGT CACCCATCAG 660 

5 

GGCCTGAGTT CGCCCGTCAC AAAGAGCTTC AACAGGGGAG AGTGT 705 
(2) INFORMATION FOR SEQ ID NO: 97: 

10 ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: protein 



20 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Met Asp Phe Gin Val Gin He Phe Ser Phe Leu Leu He Ser Ala Ser 
1 5 io 15 

Val He Met Ser Arg Gly Gin He Val Leu Thr Gin Ser Pro Ser Ser 
20 25 30 

Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr Cys Ser Ala Ser 
35 40 45 

Ser Ser- Val Thr Tyr Met His Trp Tyr Gin Gin Lys Pro Gly Lys Ala 
50 55 go 

Pro Lys Leu Leu He Tyr Ser Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 70 75 eo 



^0 



5 



10 



15 



20 



25 
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Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr lie 
85 90 95 

Ser Ser Leu Gin Pro Glu Asp lie Ala Thr Tyr Tyr Cys Gin Gin Arg 
100 105 110 

Ser Thr Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Leu Glu He Lys 
115 120 125 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu 
130 135 140 

Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe 
145 150 155 160 

Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin 
I 65 170 175 

Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser 
180 185 190 

Thr Tyr- Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
195 200 205 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser 
210 215 220 



Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 235 

30 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



• ; r 



25 

CAGGAGAGTG TCACAGAGCA GGACAGCAAG GACAGCACCT ACAG CCTCAG CAGCACCCTG 

ACGCTGAGCA AAGCAGACTA CGAGAAACAC AAAGTCTACG CCTGCGAAGT CACCCATCAG 

30 GGCCTGAGTT CGCCCGTCAC AAAGAGCTTC AACAGGGGAG AGTGT 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 
^ 5 (A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



eo 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
ATGGATTTTC AAGTGCAGAT TTTCAGCTTC CTGCTAATCA GTGCTTCAGT CATAATGTCC 
10 CGCGGCGACA TCCAGATGAC CCAGAGCCCA AGCAGCCTGA GCGCTAGCGT GGGTGACAGA i 2 0 
GTGACCATCA CGTGTAGTGC CAGCTCAAGT GTAACTTACA TGCACTGGTA CCAGCAGAAG 
CCAGGTAAGG CTCCAAAGCT GTGGATCTAC AGCACATCCA ACCTGGCTTC TGGTGTGCCA 
p AGCAGATTCT CCGGAAGCGG TAGCGGCACC GACTACACCT TCACCATCAG CAGCCTCCAG 

y i 

S=S; 

" CCAGAGGATA TCGCCACCTA CTACTGCCAG CAGAGGAGTA CTTACCCGCT CACGTTCGGC 

f j 20 CAAGGGACCA AG CTCGAG AT CAAACGGACT GTGGCTGCAC CATCTGTCTT CATCTTCCCG 420 

CCATCTGATG AGCAGTTGAA ATCTGGAACT G CCTCTGTTG TGTGCCTGCT GAATAACTTC 
TATCCCAGAG AGG CCAAAGT ACAGTGGAAG GTGGATAACG CCCTCCAATC GGGTAACTCC 



180 



240 



300 



360 



480 



540 



600 



660 



705 
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(ii) MOLECULE TYPE : protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



Met Asp Phe Gin 
1 

Val He Met Ser 
20 

Leu Ser Ala Ser 
35 

Ser Ser Val Thr 
50 

Pro Lys Leu Trp 
65 

Ser Arg Phe Ser 

Ser Ser Leu Gin 
100 

Ser Thr Tyr Pro 
115 

Arg Thr Val Ala 
130 

Gin Leu Lys Ser 
145 



Val Gin He Phe 
5 

Arg Gly Asp He 

Val Gly Asp Arg 
40 

Tyr Met His Trp 
55 

He Tyr Ser Thr 
70 

Gly Ser Gly Ser 
85 

Pro Glu Asp He 

Leu Thr Phe Gly 
120 

Ala Pro Ser Val 
135 

Gly Thr Ala Ser 
150 



Ser Phe Leu Leu 
10 

Gin Met Thr Gin 
25 

Val Thr He Thr 

Tyr Gin Gin Lys 
60 

Ser Asn Leu Ala 
75 

Gly Thr Asp Tyr 
90 

Ala Thr Tyr Tyr 
105 

Gin Gly Thr Lys 

Phe He Phe Pro 
140 

Val Val Cys Leu 
155 



He Ser Ala Ser 
15 

Ser Pro Ser Ser 
30 

Cys Ser Ala Ser 
45 

Pro Gly Lys Ala 

Ser Gly Val Pro 
60 

Thr Phe Thr He 
95 

Cys Gin Gin Arg 
110 

Leu Glu He Lys 
125 

Pro Ser Asp Glu 

Leu Asn Asn Phe 
160 



WO 97/42329 
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u 
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hi 
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Tyr Pro Arg Glu AXa Lys Val Gin Trp Lys Val Asp 



Asn Ala Leu Gin 



5 



165 170 175 

Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser 
180 185 19Q 

Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
195 200 205 

10 Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser 

210 215 220 

Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 235 



15 



(2) INFORMATION FOR SEQ ID NO: 100: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: other nucleic acid 

25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

30 

CCCAGCACCT GAACTCCTGG GAGGAGCAAC AGGACACAGT TATGAGAAGT 
(2) INFORMATION FOR SEQ ID NO: 101: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
10 GGGGGTCTAG ATTATTAGTA CAGGTGTTCC AGGACGTAGC TGGCAACATA 
(2) INFORMATION FOR SEQ ID NO: 102: 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102- 

25 

GGGGGAG CTC GGCTAGCACC AAGGGCCCAT CGGTCTTCCC CCTGGC 
(2) INFORMATION FOR SEQ ID NO: 103: 

30 (i) , SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: other nucleic acid 



50 



46 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TTGTACTTCT CATAACTGTG TCCTGTTGCT CCTCCCAGGA GTTCAGGTGC TGGGC 
5 (2) INFORMATION FOR SEQ ID NO: 104: 



55 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



P 

-7X3. 

u 

! ft 



15 



(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
y 20 GCCTGTGCTC AATATTGATG G 



21 



^ (2) INFORMATION FOR SEQ ID NO: 105- 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: ocher nucleic acid 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
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GGAGAAAGCC ATATCTGCCT G 
(2) INFORMATION FOR SEQ ID NO: 106: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

<ii) MOLECULE TYPE: other nucleic acid 



15 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 106: 
TCGCTATTAC CATGGTGATG CGGTTTTGGC 
20 (2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: "nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

30 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 107: 
GGCTGGATTC TCAGTGGCGA CTT 

35 

£2) INFORMATION FOR SEQ ID NO: 108: 



WO 97/42329 



169 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID* NO: 108 
CACAACAGAG GCAGTTCC 

15 

(2) INFORMATION FOR SEQ ID NO: 109: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE : other nucleic acid 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO- 109 

30 

CACCTTCACC ATCAGCAGCC 

(2) INFORMATION FOR SEQ ID NO: 110: 

3$ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



10 GGACCTGCTG CAGAGTCTG 



19 



(2) INFORMATION FOR SEQ ID NO: 111: 



15 

f I 

O 

in 

£3 

N 20 

ru 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 7 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



GGCTGCAGGA ATTCTTATTA TAGACGAACC CGG CTATCAA ACTGAGC 



47 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1870 base pairs 

(B) TYPE: nucleic acid 

, (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 



K 20 



30 

ACGCCCCCAC CGTGTCCCAG ATGTCCTGAA CCAAAGAGCT GTGACACTCC ACCGCCCTGC 



60 



300 



360 



420 



AAGCTTGCCG CCACCATGAA GTTGTGGCTG AACTGGATTT TCCTTGTAAC ACTTTTAAAT 
5 GGAATTCAGT GTGAGGTGCA GCTGCAGCAG AG CGGTCCAG GTCTCGTACG GCCTAGCCAG 120 
ACCCTGAGCC TCACGTGCAC CGCATCTGGC TTCAACATTA AGGACAATTA CATGCACTGG 1B0 
GTGAGACAGC CACCTGGACG AGGCCTTGAG TGGATTGGAT GGATTGACCC TGAGAATGGT 240 

10 

GACACTGAGT ACGCACCTAA GTTTCGCGGC CGCGTGACAA TGCTGGCAGA CACTAGTAAG 
0 AACCAGTTCA GCCTGAGACT CAGCAGCGTG ACAGCCGCCG ACACCGCGGT CTATTATTGT 

■q 

"J 15 CACGTCCTGA TATACGCCGG GTATCTGGCA ATGGACTACT GGGGCCAAGG GACCCTCGTC 

o 

Jjj ACCGTGAGCT CGGCTAGCAC CAAGGGCCCA TCGGTCTTCC CCCTGGCGCC CTGCTCCAGG 460 

AGCACCTCTG GGGGCACAGC GGCCCTGGGC TGCCTGGTCA AGGACTACTT CCCCGAACCG 
GTGACGGTGT CGTGGAACTC AGGCGCCCTG ACCAGCGGCG TGCACACCTT CCCGGCTGTC 
CTACAGTCCT CAGGACTCTA CTCCCTCAGC AGCGTGGTGA CCGTGCCCTC CAGCAGCTTG 
25 GGCACCCAGA CCTACACCTG CAACGTGAAT CACAAGCCCA GCAACACCAA GGTGGACAAG 
AGAGTGGAGC TGAAAACCCC ACTCGGTGAC ACAACTCACA CGTG CCCTAG GTGTCCTGAA 
CCTAAATCTT GTGACACACC TCCCCCGTGC CCACGGTGCC CAGAGCCCAA ATCTTGCGAC 



540 



660 



720 



780 



840 



900 



960 



CCGAGGTGCC CAGCACCTGA ACTCCTGGGA GG AG CAACAG GACACAGTTA TGAGAAGTAC 
35 AACAAGTGGG AAACGATAGA GGCTTGGACT CAACAAGTCG CCACTGAGAA TCCAGCCCTC 1020 
ATCTCTCGCA GTGTTATCGG AACCACATTT GAGGGACGCG CTATTTACCT CCTGAAGGTT 1080 
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GGCAAAGCTG GACAAAATAA GCCTGCCATT TTCATGGACT GTGGTTTCCA TGCCAGAGAG 1140 
TGGATTTCTC CTGCATTCTG CCAGTGGTTT GTAAGAGAGG CTGTTCGTAC CTATGGACGT 1200 



5 GAGATCCAAG TGACAGAGCT TCTCGACAAG TTAGACTTTT ATGTCCTGCC TGTGCTCAAT 



10 



CTCGGTGAGA ACAATG CTG A GTTGAATGCC CTGG CTAAAG CTACTGTGAA AGAACTTGCC 



■%3 

3 



p TCACTGCACG GCACCAAGTA CACATATGGC CCGGGAGCTA CAACAATCTA TCCTTCTGCT 

H 20 

?! Z 

r l_3 

iy GGGACTTCTA AAGACTGGGC TTATGACCAA GGAATCAGAT ATTCCTTCAC CTTTGAACTT 

£3 

f ~ CGAGATACAG GCAGATATGG CTTTCTCCTT CCAGAATCCC AGATCCGGGC TACCTGCGAG 



25 GAGACCTTCC TGGCAATCAA GTATGTTGCC AG CTACGTCC TGGAACAGCT 



TCTAGAGAGA 

(2) INFORMATION FOR SEQ ID NO: 113 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 613 amino aci-ds 

(B) TYPE: amino acid 
tC) STRANDEDNESS : single 
( D ) TOPOLOGY : 1 inear 



30 



35 



1260 



ATTGATGGCT ACATCTACAC CTGGACCAAG AGCCGATTTT GGAGAAAGAC TCGCTCCACC 1320 



CATACTGGAT CTAG CTG CAT TGGCACAGAC CCCAACAGAA ATTTTGATGC TGGTTGGTGT 13 



80 



GAAATTGGAG CCTCTCGAAA CCCCTGTGAT GAAACTTACT GTGGACCTGC CGCAGAGTCT 1440 



1500 



P GAAAAGGAGA CCAAGGCCCT GGCTGATTTC ATCCGCAACA AACTCTCTTC CATCAAGGCA 

^ 15 TATCTGACAA TCCACTCGTA CTCCCAAATG ATGATCTACC CTTACTCATA TGCTTACAAA 1560 



1620 



16B0 



1740 



1800 



GTACTAATAA 1860 



1870. 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Met Lys Leu Trp Leu Asn Trp He Phe Leu Val Thr Leu Leu Asn Gly 
1 5 10 15 

He Gin Cys Glu Val Gin Leu Gin Gin Ser Gly Pro Gly Leu Val Arg 
20 25 30 

Pro Ser Gin Thr . Leu Ser Leu Thr Cys Thr Ala Ser Gly Phe Asn lie 
35 40 45 

Lys Asp Asn Tyr Met His Trp Val Arg Gin Pro Pro Gly Arg Gly Leu 
50 55 60 



Glu Trp lie Gly Trp He Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala 
65 70 75 eo 



Pro Lys Phe Arg Gly Arg Val Thr Met Leu Ala Asp Thr Ser Lys Asn 
85 90 95 



Gin Phe Ser Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val 
100 105 110 



Tyr Tyr Cys His Val Leu He Tyr Ala Gly Tyr Leu Ala Met Asp Tyr 
115 120 125 



Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys 
130 135 140 



Gly 



Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg Ser Thr Ser Gly Gly 
145 150 155 160 



Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
l g5 170 175 

Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe 
"0 IBS 190 
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Pro Ala Val Leu 
195 

Thr Val Pro Ser 
210 

Asn His Lys Pro 
225 

Thr Pro Leu Gly 

Lys Ser Cys Asp 
260 

Ser Cys Asp Thr 
275 

Cys Asp Thr Pro 
290 

Gly Gly Ala Thr 
305 

lie Glu Ala Trp 

Ser Arg Ser Val 
340 



Gin Ser Ser Gly 
200 

Ser Ser Leu Gly 
215 

Ser Asn Thr Lys 
230 

Asp Thr Thr His 
245 

Thr Pro Pro Pro 

Pro Pro Pro Cys 
280 

Pro Pro Cys Pro 
295 

Gly His Ser Tyr 
310 

Thr Gin Gin Val 
325 

lie Gly Thr Thr 



-174- 

Leu Tyr Ser Leu 

Thr Gin Thr Tyr 
220 

Val Asp Lys Arg 
235 

Thr Cys Pro Arg 
250 

Cys Pro Arg Cys 
265 

Pro Arg Cys Pro 



Arg Cys Pro Ala 
300 

Glu Lys Tyr Asn 
315 

Ala Thr Glu Asn 
330 

Phe Glu Gly Arg 
345 



Ser Ser Val Val 
205 

Thr Cys Asn Val 

Val Glu Leu Lys 
24 0 

Cys Pro Glu Pro 
255 

Pro Glu Pro Lys 
270 

Glu Pro Lys Ser 
285 

Pro Glu Leu Leu 

Lys Trp Glu Thr 
320 

Pro Ala Leu He 
335 

Ala He Tyr Leu 
350 



Leu Lys Val Gly Lys Ala Gly Gin Asn Lys Pro Ala He Phe Met Asp 
355 360 365 



Cys Gly Phe His Ala Arg Glu Trp He Ser Pro Ala Phe Cys Gin Trp 
370 375 380 
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Phe Val Arg Glu Ala Val Arg Thr Tyr Gly Arg Glu He Gin Val Thr 
385 390 395 4 oO 

Glu Leu Leu Asp Lys Leu Asp Phe Tyr Val Leu Pro Val Leu Asn He 
405 410 415 

Asp Gly Tyr He Tyr Thr Trp Thr Lys Ser Arg Phe Trp Arg Lys Thr 
420 425 4 3 o 

Arg Ser Thr His Thr Gly Ser Ser Cys He Gly Thr Asp Pro Asn Arg 
435 440 44s 

Asn Phe Asp Ala Gly Trp Cys Glu He Gly Ala Ser Arg Asn Pro Cys 
450 455 460 



Asp Glu Thr Tyr Cys Gly Pro Ala Ala Glu Ser Glu Lys Glu Thr Lys 

4B0 



4S5 470 475 



Ala Leu Ala Asp Phe He Arg Asn Lys Leu Ser Ser He Lys Ala Tyr 
*85 490 495 

Leu Thr He His Ser Tyr Ser Gin Met Met He Tyr Pro Tyr Ser Tyr 
500 505 510 

Ala Tyr Lys Leu Gly Glu Asn Asn Ala Glu Leu Asn Ala Leu Ala Lys 
515 520 525 

Ala Thr Val Lys Glu Leu Ala Ser Leu His Gly Thr Lys Tyr Thr Tyr 
530 535 540 



Gly Pro Gly Ala Thr Thr He Tyr Pro Ser Ala Gly Thr Ser Lys Asp 
545 550 555 560 



Trp Ala Tyr Asp Gin Gly He Arg Tyr Ser Phe Thr Phe Glu Leu Arg 
565 570 575 

Asp Thr Gly Arg Tyr Gly Phe Leu Leu Pro Glu Ser Gin He Arg Ala 

560 585 590 
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Thr Cys Glu Glu Thr Phe Leu Ala XXe Lys Tyr Val Ala Ser Tyr Val 
595 600 605 

Leu Glu His Leu Tyr 
610 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



His His Gly Gly Glu His Phe Glu Gly Glu Lys Val Phe Arg Val Asn 
15 10 15 



Val Glu Asp Glu Asn His lie Asn He He Arg Glu Leu Ala Ser Thr 
20 25 30 

Thr Gin lie Asp Phe Trp Lys Pro Asp Ser Val Thr Gin He Lys Pro 
35 40 45 

His Ser Thr Val Asp Phe Arg Val Lys Ala Glu Asp Thr Val Thr Val 
50 55 60 

Glu Asn Val Leu Lys Gin Asn Glu Leu Gin Tyr Lys Val Leu He Ser 
€5 7 ° 75 80 

Asn Leu Arg Asn Val Val Glu Ala Gin Phe Asp Ser Arg Val Arg Leu 
85 90 95 
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177 



10 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 



15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115; 



GAGCTCGGCT AGCACCAAGG GCCCATCGGT CTTCCCCCTG GCGCCCTGCT CCAGGAGCAC €0 
20 CTCTGGGGGC ACAGCGGCCC TGGGCTGCCT GGTCAAGGAC TACTTCCCCG AACCGGTGAC 120 
GGTGTCGTGG AACTCAGGCG CCCTGACCAG CGGCGTGCAC ACCTTCCCGG CTGTCCTACA 180 



25 



GTCCTCAGGA CTCTACTCCC TCAGCAGCGT GGTGACCGTG CCCTCCAGCA G CTTGGGCAC 



CCAGACCTAC ACCTGCAACG TGAATCACAA GCCCAGCAAC ACCAAGGTGG ACAAGAGAGT 



240 



300 



GGAG CTG AAA ACCCCACTCG GTGACACAAC TCACACGTGC CCTAGGTGTC CTGAACCTAA 



360 



30 ATCTTGTGAC ACACCTCCCC CGTGCCCACG GTG CCCAGAG 



CCCAAATCTT GCGACACGCC 420 



CCCACCGTGT CCCAGATGTC CTGAACCAAA GAGCTGTGAC ACTCCACCGC CCTGCCCGAG 4 80 



GTGCCCAGCA CCTGAACTCC TGGGAGGGTA ATAGCCCGGG 



35 



(2) INFORMATION FOR SEQ ID NO: 116: 



520 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY : linear 

5 

(ii) MOLECULE TYPE: other nucleic acid 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116; 



IJ GTTATTACTC GCTGCCCAAC CAGCCATGGC G n , 

M 15 (2) INFORMATION FOR SEQ ID NO: 117: 

C3 

Lfl SEQUENCE CHARACTERISTICS: 

. m 



(A) LENGTH: 23 base pairs 

Q (B> TYPE: nucleic acid 

20 (C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 : 

30 GCAGCAGGAT AGATTGTTGT AGC 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: ocher nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
CCGGAATTCT TATTAGTTCA GGTCCTCCTC AGAGATCAGC TTCTGCTCCT CGAACTCATG 60 

10 

GTGGTGATGG TGGTGGTACA GGTGTTCC 



(2) INFORMATION FOR SEQ ID NO: 119 s 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: other nucleic acid 



25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
CAATCTATCC TGCTGCTGGG ACTTCTAAAG 
30 (2) INFORMATION FOR SEQ ID NO: 120: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
35 (B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 



88 



30 
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(ii) MOLECULE TYPE: ther nucleic acid 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 
GATTGTTGTA GCTCCCGGGC 
10 (2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

25 GGAGCTACAA CAATCTATCC TTCTGCTGGG 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 22 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 (ii) MOLECULE 



TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 
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ACGGCACCAA GTACACATAT GG 

<2) INFORMATION FOR SEQ ID NO: 123: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: other nucleic acid 



60 



90 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
ACGAGAATTC GACCG CTCTG CTGCAGCTGC ACCTCGGAAC CGCCACCGCT GCCACCGCCA 
20 GAACCGCCAC CGTACAGGTG TTCCAGGACG 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
35 ATGTTGGCAC TCTTGGTTCT GGTGACTGTG GCCCTGGCAT CTGCTCATCA TGGTGGTGAG 60 
CACTTTGAAG GCGAGAAGGT GTTCCGTGTT AACGTTGAAG ATGAAAATCA CATTAACATA 120 
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ATCCGCGAGT TGGCCAGCAC GACCCAGATT GACTTCTGGA AGCCAGATTC TGTCACACAA 
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180 



240 



300 



ATCAAACCTC ACAGTACAGT TGACTTCCGT GTTAAAGCAG AAGATACTGT CACTGTGGAG 
5 AATGTTCTAA AGCAGAATGA ACTACAATAC AAGGTACTGA TAAGCAACCT GAGAAATGTG 

GTGGAGGCTC AGTTTGATAG CCGGGTTCGT GCAACAGGAC ACAGTTATGA GAAGTACAAC 360 
AAGTGGGAAA CGATAGAGGC TTGGACTCAA CAAGTCGCCA CTGAGAATCC AGCCCTCATC 420 

10 

TCTCGCAGTG TTATCGGAAC CACATTTGAG GGACGCGCTA TTTACCTCCT GAAGGTTGGC 480 



540 



600 



660 



AAAGCTGGAC AAAATAAGCC TGCCATTTTC ATGGACTGTG GTTTCCATGC CAGAGAGTGG 
15 ATTTCTCCTG CATTCTGCCA GTGGTTTGTA AGAGAGGCTG TTCGTACCTA TGGACGTGAG 
ATCCAAGTGA CAGAGCTTCT CGACAAGTTA GACTTTTATG TCCTGCCTGT GCTCAATATT 

GATGGCTACA TCTACACCTG GACCAAGAGC CGATTTTGGA GAAAGACTCG CTCCACCCAT 720 

20 

ACTGGATCTA GCTGCATTGG CACAGACCCC AACAGAAATT TTGATGCTGG TTGGTGTGAA 780 

ATTGGAGCCT CTCGAAACCC CTGTGATGAA ACTTACTGTG GACCTGCCGC AGAGTCTGAA 840 

25 AAGGAGACCA AGGCCCTGGC TGATTTCATC CGCAACAAAC TCTCTTCCAT CAAGGCATAT 900 

CTGACAATCC ACTCGTACTC CCAAATGATG ATCTACCCTT ACTCATATGC TTACAAACTC 960 

GGTGAGAACA ATGCTGAGTT GAATGCCCTG GCTAAAGCTA CTGTGAAAGA ACTTGCCTCA 1020 

CTGCACGGCA CCAAGTACAC ATATGGCCCG GGAGCTACAA CAATCTATCC TTCTGCTGGG 1080 

ACTTCTAAAG ACTGGGCTTA TGACCAAGGA ATCAGATATT CCTTCACCTT TGAACTTCGA 1140 

35 GATACAGGCA GATATGGCTT TCTCCTTCCA GAATCCCAGA TCCGGGCTAC CTGCGAGGAG 1200 

ACCTTCCTGG CAATCAAGTA TGTTGCCAGC TACGTCCTGG AACACCTGTA CGGTGGCGGT 1260 
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TCTGGCGGTG GCAGCGGTGG CGGTTCCGAG GTGCAGCTGC AGCAGAGCGG TCCAGGTCTC 1320 

GTACGGCCTA GCCAGACCCT GAGCCTCACG TGCACCGCAT CTGGCTTCAA CATTAAGGAC 1380 

5 AATTACATGC ACTGGGTGAG ACAGCCACCT GGAOGAGGCC TTGAGTGGAT TGGATGGATT 1440 

GACCCTGAGA ATGGTGACAC TGAGTACGCA CCTAAGTTTC GCGGCCGCGT GACAATGCTG 1500 



10 



GCAGACACTA GTAAGAACCA GTTCAGCCTG AGACTCAGCA GCGTGACAGC CGCCGACACC 1560 
GCGGTCTATT ATTGT CACGT CCTGATATAC GCCGGGTATC TGGCAATGGA CTACTGGGGC 1620 
CAAGGGACCC TCGTCACCGT GAGCTCGGCT AGCACCAAGG GCCCATCGGT CTTCCCCCTG 1680 



15 GCGCCCTGCT CCAGGAG CAC CTCTGGGGGC ACAGCGGCCC TGGGCTGCCT 



GGTCAAGGAC 1740 



TACTTCCCCG AACCGGTGAC GGTGTCGTGG AACTCAGGCG CCCTGACCAG CGGCGTGCAC 1800 



20 



ACCTTCCCGG CTGTCCTACA GTCCTCAGGA CTCTACTCCC TCAGCAGCGT GGTGACCGTG I860 



CCCTCCAGCA GCTTGGGCAC CCAGACCTAC ACCTGCAACG TGAATCACAA GCCCAGCAAC 



1920 



ACCAAGGTGG ACAAGAGAGT GG AG CTGAAA ACCCCACTCG GTGACACAAC TCACACGTGC 1980 



25 CCTAGGTGTC CTGAACCTAA ATCTTGTGAC ACACCTCCCC CGTGCCCACG 



GTGCCCAGAG 2040 



CCCAAATCTT GCGACACGCC CCCACCGTGT CCCAGATGTC CTGAACCAAA GAG CTGTGAC 



2100 



ACTCCACCGC CCTGCCCGAG GTGCCCAGCA CCTGAACTCC TGGGAGGGTA ATAG 



30 



(2) INFORMATION FOR SEQ ID NO: 125: 



2154 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



i§9 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Met Leu Ala Leu Leu Val Leu Val Thr Val Ala Leu Ala Ser Ala His 
1 5 10 15 

His Gly Gly Glu His Phe Glu Gly Glu Lys Val Phe Arg Val Asn Val 
20 25 30 

Glu Asp Glu Asn His lie Asn lie lie Arg Glu Leu Ala Ser Thr Thr 
35 40 45 

Gin He Asp Phe Trp Lys Pro Asp Ser Val Thr Gin He Lys Pro His 
50 55 eo 

Ser Thr Val Asp Phe Arg Val Lys Ala Glu Asp Thr Val Thr Val Glu 
65 70 75 80 

Asn Val Leu Lys Gin Asn Glu Leu Gin Tyr Lys Val Leu He Ser Asn 
65 90 95 

Leu Arg Asn Val Val Glu Ala Gin Phe Asp Ser Arg Val Arg Ala Thr 
100 105 no 

Gly His Ser Tyr Glu Lys Tyr Asn Lys Trp Glu Thr H e Glu Ala Trp 
115 120 125 



Thr Gin Gin Val Ala Thr Glu Asn Pro Ala Leu He Ser Arg Ser Val 
130 135 i 4 o 

He Gly Thr Thr Phe Glu Gly Arg Ala He Tyr Leu Leu Lys Val Gly 
145 150 155 i 6 o 
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Lys Ala Gly Gin Asn Lys Pro Ala lie Phe Met Asp Cys Gly Phe His 
165 170 1?5 



Ala Arg Glu Trp lie Ser Pro Ala Phe 



Cys Gin Trp Phe Val Arg Glu 



160 «S 190 

Ala Val Arg Thr Tyr Gly Arg Glu He Gin Val Thr Glu Leu Leu Asp 
195 200 205 

Lys Leu Asp Phe Tyr Val Leu Pro Val Leu Asn He Asp Gly Tyr He 
210 215 220 



Tyr Thr Trp Thr Lys Ser Arg Phe Trp Arg Lys Thr Arg Ser Thr His 

240 



225 230 235 



Thr Gly Ser Ser Cys He Gly Thr Asp Pro Asn Arg Asn Phe Asp Ala 
245 250 255 

Gly Trp Cys Glu He Gly Ala Ser Arg Asn Pro Cys Asp Glu Thr Tyr 
260 265 270 

Cys Gly P "ro Ala Ala Glu Ser Glu Lys Glu Thr Lys Ala Leu Ala Asp 
275 280 285 



Phe He Arg Asn Lys Leu Ser Ser He Lys Ala Tyr Leu Thr II 

300 



e His 

290 295 



Ser Tyr Ser Gin Met Met He Tyr Pro Tyr 

315 320 



Tyr Ser Tyr Ala Tyr Lys Leu 

305 310 



Gly Glu Asn Asn Ala Glu Leu Asn Ala Leu Ala Lys Ala Thr Val Lys 
325 ' 330 • 335 



Glu Leu Ala Ser Leu His Gly Thr Lys Tyr Thr Tyr Gly Pro Gly Al 
340 345 350 



Thr Thr He Tyr Pro Ser Ala Gly Thr Ser Lys Asp Trp Ala Tyr Asp 



355 



360 



365 
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Gln Gly lie Arg Tyr Ser Phe Thr Phe Glu Leu Arg Asp Thr Gly Arg 
370 375 380 



Tyr Gly Phe Leu Leu Pro Glu Ser Gin lie Arg Ala Thr Cys Glu Glu 
3flS 390 395 400 



Thr Phe Leu Ala lie Lys Tyr Val Ala Ser Tyr Val Leu Glu His Leu 

415 



405 410 



Tyr Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Glu Val Gin 

430 



«0 425 



Leu Gin Gin Ser Gly Pro Gly Leu Val Arg Pro Ser Gin Thr Leu Ser 
435 440 445 



Leu Thr Cys Thr Ala Ser Gly Phe Asn lie Lys Asp Asn Tyr Met His 

460 



450 455 



Trp Val Arg Gin Pro Pro Gly Arg Gly Leu Glu Trp He Gly Trp He 

475 480 



4S5 470 



Asp Pro Glu Asn Gly Asp Thr Glu Tyr Ala Pro Lys Phe Arg Gly Arg 

495 



485 490 



Val Thr Met Leu Ala Asp Thr Ser Lys Asn Gin Phe Ser Leu Arg Leu 
500 505 510 



Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys His Val Leu 

525 



515 520 



He Tyr Ala Gly Tyr Leu Ala Met Asp Ty r Trp Gly Qln Qly Thr Leu 
530 «5 540 



Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu 
54 S 550 



555 



560 
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Ala Pr Cys Ser Arg Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys 
565 570 575 

Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser 
580 585 590 

Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gin Ser 
595 600 6 05 

Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser 
610 615 6 20 

Leu Gly Thr Gin Thr Tyr Thr Cys Asn Val Asn His Lys Pro Ser Asn 
625 «0 635 640 

Thr Lys Val Asp Lys Arg Val Glu Leu Lys Thr Pro Leu Gly Asp Thr 
645 650 655 

Thr His Thr Cys Pro Arg Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro 
660 665 670 

Pro Pro Cys Pro Arg Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro 
675 680 685 

Pro Cys Pro Arg Cys Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro 
690 695 700 

Cys Pro Arg Cys Pro Ala Pro Glu Leu Leu Gly Gly 
7 °5 710 7X5 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



WO 97/42329 





PCT/GB97/01165 



- 188- 



(ii) MOLECULE TYPE: other nucleic acid 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
TATATAAAGC TTGCCGCCAC CATGGGCCAC ACACGGAGGC AG 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
ACTCCACCAG CTTCACCTCG TTATCAGGAA AATGCTCTTG CTTGG 
(2) INFORMATION FOR SEQ ID NO: 128: 
(i) SEQUENCE CHARACTERISTICS: 



5 



10 (2) INFORMATION FOR SEQ ID NO: 127: 



20 



30 



(A) 5 LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 128: 
AGAGCATTTT CCTGATAACG AGGTGAAGCT GGTGGAGTCT GGAGG 
(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
CCAGGCATCC CAGGGTCACC ATGGAGTTAG TTTGGGCAGC 
(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 16 . .143 5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130; 



AAGCTTGCCG CCACC ATG GGC CAC ACA CGG AGG CAG GGA ACA TCA CCA TCC 51 
Met Gly His Thr Arg Arg Gin Gly Thr Ser Pro Ser 

10 



5 



AAG TGT CCA TAC CTC AAT TTC TTT CAG CTC TTG GTG CTG GCT GGT CTT 99 
Lys Cys Pro Tyr Leu Asn Phe Phe Gin Leu Leu Val Leu Ala Gly Leu 
15 20 25 

10 

TCT CAC TTC TGT TCA GGT GTT ATC CAC GTG ACC AAG GAA GTG AAA GAA 147 
Ser His Phe Cys Ser Gly Val He His Val Thr Lys Glu Val Lys Glu 
30 35 40 



ATG TCT GGG GAC ATG AAT ATA TGG CCC GAG TAC AAG AAC CGG ACC ATC 
Met Ser Gly Asp Met Asn lie Trp Pro Glu Tyr Lys Asn Arg Thr He 
25 80 85 90 

TTT GAT ATC ACT AAT AAC CTC TCC ATT GTG ATC CTG GCT CTG CGC CCA 
Phe Asp He Thr Asn Asn Leu Ser He Val He Leu Ala Leu Arg Pro 



30 



95 100 105 



35 GCT TTC AAG CGG GAA CAC CTG GCT GAA GTG ACG TTA TCA GTC AAA GCT 
Ala Phe Lys Arg Glu His Leu Ala Glu Val Thr Leu Ser Val Lys Ala 



125 



130 



195 



15 GTG GCA ACG CTG TCC TGT GGT CAC AAT GTT TCT GTT GAA GAG CTG GCA 
Val Ala Thr Leu Ser Cys Gly His Asn Val Ser Val Glu Glu Leu Ala 
45 50 55 60 

CAA ACT CGC ATC TAC TGG CAA AAG GAG AAG AAA ATG GTG CTG ACT ATG 243 
20 Gin Thr Arg He Tyr Trp Gin Lys Glu Lys Lys Met Val Leu Thr Met 

65 70 75 



291 



339 



TCT GAC GAG GGC ACA TAC GAG TGT GTT GTT CTG AAG TAT GAA AAA GAC 3B7 
Ser Asp Glu Gly Thr Tyr Glu Cys Val Val Leu Lys Tyr Glu Lys Asp 
HO 115 120 



435 



135 140 



ME 
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U 15 190 195 200 



25 GAG CAT TTT CCT GAT AAC GAG GTG AAG CTG GTG GAG TCT GGA GGA GGC 
Glu His Phe Pro Asp Asn Glu Val Lys Leu Val Glu Ser Gly Gly Gly 
240 245 250 



30 TTG GTA CAG CCT GGG GGT TCT CTG AGA CTC TCC TGT GCA ACT TCT GGG 

Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Thr Ser Gly 
255 ' 260 265 

TTC ACC TTC ACT GAT TAC TAC ATG AAC TGG GTC CGC CAG CCT CCA GGA 

35 Phe Thr Phe Thr Asp Tyr Tyr Met Asn Trp Val Arg Gin Pro Pro Gly 
270 275 280 

AAG GCA CTT GAG TGG TTG GGT TTT ATT GGA AAC AAA GCT AAT GGT TAC 
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GAC TTC CCT ACA CCT AGT ATA TCT GAC TTT GAA ATT CCA ACT TCT AAT 483 
Asp Phe Pro Thr Pro Ser He Ser Asp Phe Glu He Pro Thr Ser Asn 
1*5 150 155 

5 ATT AGA AGG ATA ATT TGC TCA ACC TCT GGA GGT TTT CCA GAG CCT CAC 531 
He Arg Arg He He Cys Ser Thr Ser Gly Gly Phe Pro Glu Pro His 
160 165 170 

CTC TCC TGG TTG GAA AAT GGA GAA GAA TTA AAT GCC ATC AAC ACA ACA 579 
10 Leu Ser Trp Leu Glu Asn Gly Glu Glu Leu Asn Ala He Asn Thr Thr 
175 180 185 

GTT TCC CAA GAT CCT GAA ACT GAG CTC TAT GCT GTT AGC AGC AAA CTG 627 
Val Ser Gin Asp Pro Glu Thr Glu Leu Tyr Ala Val Ser Ser Lys Leu 



GAT TTC AAT ATG ACA ACC AAC CAC AGC TTC ATG TGT CTC ATC AAG TAT 675 
Asp Phe Asn Met Thr Thr Asn His Ser Phe Met Cys Leu He Lys Tyr 
205 210 215 220 

20 

GGA CAT TTA AGA GTG AAT CAG ACC TTC AAC TGG AAT ACA ACC AAG CAA 723 
Gly His Leu Arg Val Asn Gin Thr Phe Asn Trp Asn Thr Thr Lys Gin 
225 230 235 



771 



819 



867 



915 
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Lys Ala Leu Glu Trp Leu Gly Phe lie Gly Asn Lys Ala Asn Gly Tyr 
285 290 295 300 

ACA ACA GAG TAC ACT GCA TCT GTG AAG GGT CGG TTC ACC ATC TCC AGA 963 
5 Thr Thr Glu Tyr Ser Ala Ser Val Lys Gly Arg Phe Thr He Ser Arg 
305 310 315 



GAC AAA TCC CAA AGC ATC CTC TAT CTT CAA ATG AAC ACC CTG AGA GCT 1011 
Asp Lys Ser Gin Ser lie Leu Tyr Leu Gin Met Asn Thr Leu Arg Ala 

330 



10 320 32S 



CCT GAG CCA GTG ACA GTG ACC TGG AAC TCT GGA TCT CTG TCC AGC GGT 
Pro Glu Pro Val Thr Val Thr Trp Asn Ser Gly Ser Leu Ser Ser Gly 

405 410 



30 400 



415 420 



35 



AGC TCA GTG ACT GTC CCC TCC AGC ACC TGG CCC AGC GAG ACC GTC ACC 
Ser Ser Val Thr Val Pro Ser Ser Thr Trp Pro Ser Glu Thr Val Thr 
430 «5 440 



1059 



1107 



GAG GAC ACT GCC ACT TAT TAC TGT ACA AGA GAT AGG GGG CTA CGG TTC 
Jjj Glu Asp Ser Ala Thr Tyr Tyr Cys Thr Arg Asp Arg Gly Leu Arg Phe 

<l9 335 340 345 

M" 15 
C3 

p TAC TTT GAC TAC TGG GGC CAA GGC ACC ACT CTC ACA GTC TCC TCA GCC 

j| Tyr Phe Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala 

T 3S ° 355 360 

f j 20 AAA ACG ACA CCC CCA TCT GTC TAT CCA CTG GCC CCT GGA TCT GCT GCC 1155 

y Lys Thr Thr Pro Pro Ser Val Tyr Pro Leu Ala Pro Gly Ser Ala Ala 

CAA ACT AAC TCC ATG GTG ACC CTG GGA TGC CTG GTC AAG GGC TAT TTC 1203 
25 Gin Thr Asn Ser Met Val Thr Leu Gly Cys Leu Val Lys Gly Tyr Phe 
385 390 395 



1251 



GTG CAC ACC TTC CCA GCT GTC CTG CAG TCT GAC CTC TAC ACT CTG AGC 1299 
Val His Thr Phe Pro Ala Val Leu Gin Ser Asp Leu Tyr Thr Leu Ser 

425 



1347 



WO 97/42329 
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TGC AAC GTT GCC CAC CCG GCC AGC AGC ACC AAG GTG GAC AAG AAA ATT 
Cys Asn Val Ala His Pro Ala Ser Ser Thr Lys Val Asp Lys Lys He 

445 450 ice 

455 460 



1395 



GTG CCC AGG GAT TGT GGT TGT AAG CCT TGC ATA TGT ACA T AGTAAGAATT 
Val Pro Arg Asp Cys Gly Cys Lys Pro Cys He Cys Thr 
4S5 470 



1445 



10 c 



1446 



^ (2) INFORMATION FOR SEQ ID NO: 131: 

H; 15 (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 4 73 amino acids 

in (B) TYPE: amino acid 

^* (D) TOPOLOGY: linear 

^ 20 (ii) MOLECULE TYPE: protein 

UJ (xi > SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



Met Gly His Thr Arg Arg Gin Gly Thr Ser Pro Ser Lys Cys Pro Tyr 
1 5 1° 15 

25 

Leu Asn Phe Phe Gin Leu Leu Val Leu Ala Gly Leu Ser His Phe Cys 

Ser Gly Val lie His Val Thr Lys Glu Val Lys Glu Val Al 



30 



a Thr Leu 
45 



Ser Cys Gly His Asn Val Ser Val Glu Glu Leu Ala Gin Thr Arg lie 



50 



55 



60 



35 Tyr Trp Gln Lys Glu Lys Lys Met Val m ^ ^ t g ^ ^ 

65 70 

70 75 80 
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Met Asn lie Trp Pro Glu Tyr Lys Asn Arg Thr He Phe Asp He Thr 



85 



90 



95 



Asn Asn Leu Ser He Val He Leu 



Ala Leu Arg Pro Ser Asp Glu Gly 



100 



105 



110 



Thr Tyr Glu Cys Val Val Leu Lys Tyr Glu Lys Asp Ala Phe Lys Arg 



115 



120 



125 



10 Glu His Leu Ala Glu Val Thr 



Leu Ser Val Lys Ala Asp Phe 



Pro Thr 



130 



135 



140 



%P Pro Ser He Ser Asp Phe Glu He 



145 



Pro Thr Ser Asn He Arg Arg He 



150 



15 



155 



160 



fees' 

in 

3 



He Cys Ser Thr Ser Gly Gly Phe Pro Glu Pro His Leu Ser Trp Leu 
165 



170 



175 



TO 20 



Glu Asn Gly Glu Glu Leu Asn Ala He Asn Thr 



160 



Thr Val Ser Gin Asp 



185 



190 



200 



Pro Glu Thr Glu Leu Tyr Ala Val Ser Ser Lys Leu Asp Phe Asn Met 

195 

25 Thr Thr Asn His Se 



205 



3er Phe Met Cys Leu He Lys Tyr Gly 



210 



His Leu Arg 



215 



220 



Val Asn Gin Thr Phe Asn Trp Asn Thr Thr Lys Gin Glu His Phe Pro 

30 225 230 235 

235 240 



Asp Asn Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro 



245 



250 



35 Gly Gly Ser Leu Arg Leu Ser Cys Ala Thr Ser Gly Phe 



260 



265 



255 

Thr Phe Thr 
270 
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Asp Tyr Tyr Met Asn Trp Val Ar 9 Gin Pro Pro Gly Lys Ala Leu Glu 
275 280 285 



Trp Leu Gly Phe lie Gly Asn Lys Ala Asn Gly Tyr Thr Thr Glu Tyr 

300 



5 -290 295 



Ser Ala Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Lys Ser Gin 
305 3 " 315 320 

10 ser He Leu Tyr Leu Gin Met Asn Thr Leu Arg Ala Glu Asp Ser Ala 



325 330 



335 



O ^ ^ Thr ASP Arg Gly Leu **9 Phe Tyr Phe Asp Tyr 

f 340 345 350 

15 

g Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr Pro 

m 355 360 365 

\3 



£._5 



Pro Ser Val Tyr Pro Leu Ala Pro Gly Ser Ala Ala Gin Thr Asn Ser 
•4 20 370 375 3ao 



§ M6t 731 Thr LeU Leu Val ^s Gly Tyr Phe Pro Glu Pro Val 

P 385 390 395 4Q0 

25 Thr Val Thr Trp Asn Ser Gly Ser Leu Ser Ser Gly Val His Thr Phe 
405 "0 415 

Pro Ala Val Leu Gin Ser Asp Leu Tyr Thr Leu Ser Ser Ser Val Thr 
420 «S 430 

30 



Val Pro Ser Ser Thr Trp Pro Ser Glu Thr Val Thr Cys Asn Val 

445 



Ala 

435 44 0 



His Pro Ala Ser Ser Thr Lys Val Asp Lys Lys lie Val Pro Arg Asp 



35 450 455 



Cys Gly Cys Lys Pro Cys He Cys Thr 
4SS 470 



460 



